Word detecting in document image based on two-stage model

  • Xiujuan Li*
  • , Zhimin Huang
  • , Ying Wen
  • , Yue Lu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a word detecting method for document image using character models and word models to evaluate the features of single-character and between-character. First, the text line is segmented into several fragments. Second, the candidate character, which is generated by merging some consecutive fragments, will be identified to be the right one if it conforms to the query word character models. Third, the path search strategy is used to search the candidate words constructed with candidate characters. The word model is used to identify the matching cost. Our experimental results on a dataset of document images demonstrate the effectiveness of the proposed method.

Original languageEnglish
Title of host publicationAdvances on Digital Television and Wireless Multimedia Communications - 9th International Forum on Digital TV and Wireless Multimedia Communication, IFTC 2012, Proceedings
Pages175-181
Number of pages7
DOIs
StatePublished - 2012
Event9th International Forum on Digital TV and Wireless Multimedia Communication, IFTC 2012 - Shanghai, China
Duration: 9 Nov 201210 Nov 2012

Publication series

NameCommunications in Computer and Information Science
Volume331 CCI
ISSN (Print)1865-0929

Conference

Conference9th International Forum on Digital TV and Wireless Multimedia Communication, IFTC 2012
Country/TerritoryChina
CityShanghai
Period9/11/1210/11/12

Keywords

  • Character Model
  • Word Detecting
  • Word Model

Fingerprint

Dive into the research topics of 'Word detecting in document image based on two-stage model'. Together they form a unique fingerprint.

Cite this