Adjacency matrix based full-text indexing models

  • Shuigeng Zhou
  • , Jihong Guan
  • , Yunfa Hu
  • , Jiangtao Hu
  • , Aoying Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes two new character-based full-text indexing models, i.e., adjacency matrix based inverted file and adjacency matrix based PAT array. Formally, the former is a kind of reorganization of the traditional inverted file, and the latter is a kind of decomposition of the traditional PAT array. Both organize text-indexing information in the form of adjacency matrix. Query algorithms for the new models are developed and performance comparisons between the new models and the traditional models are carried out. The new models can improve query-processing efficiency considerably at the cost of much less amount of extra storage overhead compared to the size of original text database, so are suitable for applications of large-scale text databases, especially Chinese text databases.

Original languageEnglish
Title of host publicationAdvances in Web-Age Information Management - 2nd International Conference, WAIM 2001, Proceedings
EditorsX. Sean Wang, Ge Yu, Hongjun Lu
PublisherSpringer Verlag
Pages60-71
Number of pages12
ISBN (Print)9783540477143
DOIs
StatePublished - 2001
Externally publishedYes
Event2nd International Conference on Web-Age Information Management, WAIM 2001 - Xi’an, China
Duration: 9 Jul 200111 Jul 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2118
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Conference on Web-Age Information Management, WAIM 2001
Country/TerritoryChina
CityXi’an
Period9/07/0111/07/01

Fingerprint

Dive into the research topics of 'Adjacency matrix based full-text indexing models'. Together they form a unique fingerprint.

Cite this