Modeling queries with contextual snippets for information retrieval

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Query expansion under the pseudo-relevance feedback (PRF) framework has been extensively studied in information retrieval. However, most expansion methods are mainly based on the statistics of single terms, which can generate plenty of irrelevant query terms and decrease retrieval performance. To alleviate this problem, we propose an approach that adapts the PRF-based contextual snippets into a context-aware topic model to enhance query representations. Specifically, instead of selecting a series of independent terms, we make full use of the query contextual information and focus on the snippets with the length of n in the PRF documents. Furthermore, we propose a context-aware topic (CAT) model to mine the topic distributions of the query-relevant snippets, namely, fine contextual snippets. In contrast to the traditional topic models that infer the topics from the whole corpus, we establish a bridge between the snippets and the corresponding PRF documents, which can be used for modeling the topics more precisely and efficiently. Finally, the topic distributions of the fine snippets are used for context-aware and topic-sensitive query representations. To evaluate the performance of our approach, we integrate the obtained queries into a topic-based hybrid retrieval model and conduct extensive experiments on various TREC collections. The experimental results show that our query-modeling approach is more effective in boosting retrieval performance compared with the state-of-the-art methods.

Original languageEnglish
Article number47
JournalACM Transactions on Intelligent Systems and Technology
Volume9
Issue number4
DOIs
StatePublished - Jan 2018

Keywords

  • Contextual snippet
  • Query representation
  • Topic modeling

Fingerprint

Dive into the research topics of 'Modeling queries with contextual snippets for information retrieval'. Together they form a unique fingerprint.

Cite this