Enhancing the healthcare retrieval with a self-adaptive saturated density function

  • Yang Song*
  • , Wenxin Hu
  • , Liang He
  • , Liang Dou
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

The proximity based information retrieval models usually use the same pre-define density function for all of terms in the collection to estimate their influence distribution. In healthcare domain, however, different terms in the same document have different influence distributions, the same term in different documents also has different influence distributions, and the pre-defined density function may not completely match the terms’ actual influence distributions. In this paper, we define a saturated density function to measure the best suitable density function that fits the given term’s influence distribution, and propose a self-adaptive approach on saturated density function building for each term in various circumstance. Particularly, our approach utilizing Gamma process is an unsupervised model with no requirements for external resources. Then, we construct a density based weighting method for the purpose of evaluating the effectiveness of our approach. Finally, we conduct our experiment on five standard CLEF and TREC datasets, and the experimental results show that our approach is promising and outperforms the pre-defined density functions in healthcare retrieval.

Original languageEnglish
Title of host publicationAdvances in Knowledge Discovery and Data Mining - 23rd Pacific-Asia Conference, PAKDD 2019, Proceedings
EditorsZhiguo Gong, Min-Ling Zhang, Zhi-Hua Zhou, Qiang Yang, Sheng-Jun Huang
PublisherSpringer Verlag
Pages501-513
Number of pages13
ISBN (Print)9783030161477
DOIs
StatePublished - 2019
Event23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2019 - Macau, China
Duration: 14 Apr 201917 Apr 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11439 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2019
Country/TerritoryChina
CityMacau
Period14/04/1917/04/19

Keywords

  • Information retrieval
  • Saturated density function
  • Self-adaptive

Fingerprint

Dive into the research topics of 'Enhancing the healthcare retrieval with a self-adaptive saturated density function'. Together they form a unique fingerprint.

Cite this