跳到主要导航 跳到搜索 跳到主要内容

A Web-based System for Retrieving Document Images from Digital Library

  • Li Zhang
  • , Yue Lu
  • , Chew Lim Tan
  • National University of Singapore

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

A web-based system for retrieving imaged documents from a digital library is described in this paper. First, some image preprocessing is performed off-line on the underlying imaged document to extract its word objects. Then, each word object is represented by a string known as its feature code, based on which a feature code file of the corresponding document is constructed. On the web interface side, the system allows the user to input a set of query words and indicate either to perform "AND" or "OR" operation on them. Once receiving user's request, the system will process each query word and combine the results based on the "AND" or "OR" operation the user has chosen. As for each query word, it is first looked up in an index table that stores words being queried before. If matches are found, results will be retrieved from the index table directly and stored temporarily for subsequent merging. This speeds up searching and makes the system an incremental intelligence system. Otherwise, the system will convert the query word to a feature code string and employ a partial word matching approach to perform search on the pre-generated feature code files. Preliminary experimental results with the imaged documents of students' theses provided by our digital library show that the proposed system is efficient and promising for document image retrieval, and thus has potential applications to digital libraries.

源语言英语
主期刊名2003 Conference on Computer Vision and Pattern Recognition Workshop, CVPRW 2003
出版商IEEE Computer Society
27-34
页数8
ISBN(电子版)0769519008
DOI
出版状态已出版 - 2003
已对外发布
活动Conference on Computer Vision and Pattern Recognition Workshop, CVPRW 2003 - Madison, 美国
期限: 16 6月 200322 6月 2003

出版系列

姓名IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
3
ISSN(印刷版)2160-7508
ISSN(电子版)2160-7516

会议

会议Conference on Computer Vision and Pattern Recognition Workshop, CVPRW 2003
国家/地区美国
Madison
时期16/06/0322/06/03

指纹

探究 'A Web-based System for Retrieving Document Images from Digital Library' 的科研主题。它们共同构成独一无二的指纹。

引用此