Skip to main navigation Skip to search Skip to main content

Word searching in document images using word portion matching

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

An approach with the capability of searching a word portion in document images is proposed in this paper, to facilitate the detection and location of the user-specified query words. A feature string is synthesized according to the character sequence in the user-specified word, and each word image extracted from documents are represented by a feature string. Then, an inexact string matching technology is utilized to measure the similarity between the two feature strings, based on which we can estimate how the document word image is relevant to the user-specified word and decide whether its portion is the same as the user-specified word. Experimental results on real document images show that it is a promising approach, which is capable of detecting and locating the document words that entirely match or partially match with the user-specified word.

Original languageEnglish
Title of host publicationDocument Analysis Systems V - 5th International Workshop, DAS 2002, Proceedings
EditorsDaniel Lopresti, Jianying Hu, Ramanujan Kashi
PublisherSpringer Verlag
Pages319-328
Number of pages10
ISBN (Print)3540440682, 9783540440680
DOIs
StatePublished - 2002
Externally publishedYes
Event5th International Workshop on Document Analysis Systems, DAS 2002 - Princeton, United States
Duration: 19 Aug 200221 Aug 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2423
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Workshop on Document Analysis Systems, DAS 2002
Country/TerritoryUnited States
CityPrinceton
Period19/08/0221/08/02

Fingerprint

Dive into the research topics of 'Word searching in document images using word portion matching'. Together they form a unique fingerprint.

Cite this