跳到主要导航 跳到搜索 跳到主要内容

Optimizing top-K Retrieval: Submodularity analysis and search strategies

  • Fudan University
  • East China Normal University
  • University of London

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The key issue in top-k retrieval - finding a set of k documents (from a large document collection) that can best answer a user's query - is to strike the optimal balance between relevance and diversity. In this paper, we study the top-k retrieval problem in the framework of facility location analysis and prove the submodularity of that objective function which provides a theoretical approximation guarantee of factor for the (best-first) greedy search algorithm. Furthermore, we propose a two-stage hybrid search strategy which first obtains a high-quality initial set of top-k documents via greedy search, and then refines that result set iteratively via local search. Experiments on two large TREC benchmark datasets show that our two-stage hybrid search strategy approach outperforms the existing ones.

源语言英语
主期刊名Web-Age Information Management - 15th International Conference, WAIM 2014, Proceedings
出版商Springer Verlag
18-29
页数12
ISBN(印刷版)9783319080093
DOI
出版状态已出版 - 2014
活动15th International Conference on Web-Age Information Management, WAIM 2014 - Macau, 中国
期限: 16 6月 201418 6月 2014

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
8485 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议15th International Conference on Web-Age Information Management, WAIM 2014
国家/地区中国
Macau
时期16/06/1418/06/14

指纹

探究 'Optimizing top-K Retrieval: Submodularity analysis and search strategies' 的科研主题。它们共同构成独一无二的指纹。

引用此