跳到主要导航 跳到搜索 跳到主要内容

Adjacency matrix based full-text indexing models

  • Shuigeng Zhou
  • , Jihong Guan
  • , Yunfa Hu
  • , Jiangtao Hu
  • , Aoying Zhou
  • Wuhan University
  • Fudan University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper proposes two new character-based full-text indexing models, i.e., adjacency matrix based inverted file and adjacency matrix based PAT array. Formally, the former is a kind of reorganization of the traditional inverted file, and the latter is a kind of decomposition of the traditional PAT array. Both organize text-indexing information in the form of adjacency matrix. Query algorithms for the new models are developed and performance comparisons between the new models and the traditional models are carried out. The new models can improve query-processing efficiency considerably at the cost of much less amount of extra storage overhead compared to the size of original text database, so are suitable for applications of large-scale text databases, especially Chinese text databases.

源语言英语
主期刊名Advances in Web-Age Information Management - 2nd International Conference, WAIM 2001, Proceedings
编辑X. Sean Wang, Ge Yu, Hongjun Lu
出版商Springer Verlag
60-71
页数12
ISBN(印刷版)9783540477143
DOI
出版状态已出版 - 2001
已对外发布
活动2nd International Conference on Web-Age Information Management, WAIM 2001 - Xi’an, 中国
期限: 9 7月 200111 7月 2001

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2118
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议2nd International Conference on Web-Age Information Management, WAIM 2001
国家/地区中国
Xi’an
时期9/07/0111/07/01

指纹

探究 'Adjacency matrix based full-text indexing models' 的科研主题。它们共同构成独一无二的指纹。

引用此