An approach to matching partial word image and its application to document image retrieval

Yue Lu, Chew Lim Tan, Lin Lin

Research output: Contribution to journalConference articlepeer-review

Abstract

An approach with the capability of matching partial word image is proposed in this paper, to facilitate the issues of document image retrieval, such as detection of user-specified query words, and similarity measurement between documents. Each word image is represented by a feature string. Then, an inexact string matching technology is utilized to measure the similarity between the two feature strings generated from two word images, based on which we can estimate how one word image is relevant to the other one and thereby decide whether one is a portion of the other word. The approach is applied to two issues in the area of document information retrieval: word spotting and document similarity measurement. Experimental results on real document images show that it is a promising approach.

Original languageEnglish
Pages (from-to)379-387
Number of pages9
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume4929
DOIs
StatePublished - 16 Sep 2002
Externally publishedYes
EventOptical Information Processing Technology 2002 - Shanghai, China
Duration: 14 Oct 200218 Oct 2002

Keywords

  • Document image analysis
  • Document similarity measurement
  • Inexact matching
  • Information retrieval
  • Word image matching
  • Word spotting

Fingerprint

Dive into the research topics of 'An approach to matching partial word image and its application to document image retrieval'. Together they form a unique fingerprint.

Cite this