Estimating probability density of content types for promoting medical records search

  • Yun He
  • , Qinmin Hu*
  • , Yang Song
  • , Liang He
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Disease and symptom in medical records tend to appear in different content types: positive, negative, family history and the others. Traditional information retrieval systems depending on keyword match are often adversely affected by the content types. In this paper, we propose a novel learning approach utilizing the content types as features to improve the medical records search. Particularly, the different contents from the medical records are identified using a Bayesian-based classification method. Then, we introduce our type-based weighting function to take advantage of the content types, in which the weights of the content types are automatically calculated by estimating the probability density functions in the documents. Finally, we evaluate the approach on the TREC 2011 and 2012 Medical Records data sets, in which our experimental results show that our approach is promising and superior.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 38th European Conference on IR Research, ECIR 2016, Proceedings
EditorsMarie-Francine Moens, Nicola Ferro, Gianmaria Silvello, Giorgio Maria di Nunzio, Claudia Hauff, Fabio Crestani, Josiane Mothe, Fabrizio Silvestri
PublisherSpringer Verlag
Pages252-263
Number of pages12
ISBN (Print)9783319306704
DOIs
StatePublished - 2016
Event38th European Conference on Information Retrieval Research, ECIR 2016 - Padua, Italy
Duration: 20 Mar 201623 Mar 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9626
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference38th European Conference on Information Retrieval Research, ECIR 2016
Country/TerritoryItaly
CityPadua
Period20/03/1623/03/16

Keywords

  • Content types identification
  • Density estimation
  • Medical records search
  • Weighting function

Fingerprint

Dive into the research topics of 'Estimating probability density of content types for promoting medical records search'. Together they form a unique fingerprint.

Cite this