Run-based approach to labeling connected components in document images

  • Xiao Tu*
  • , Yue Lu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

A fast algorithm is proposed in this paper to label connected components in binary document images. Runs are extracted from the image row by row. The positional relations among the runs of current rows and the runs of their preceding rows are represented utilizing trees, where each tree corresponds to a connected component. Only one-pass scan is required for the proposed approach to obtain the characteristics of the connected components, such as bounding rectangle, area, number of pixels. It is thus a fast and effective algorithm. Experimental results have shown that the efficiency of the present algorithm is superior to that of the conventional algorithms in terms of computational speed.

Original languageEnglish
Title of host publication2nd International Workshop on Education Technology and Computer Science, ETCS 2010
Pages206-209
Number of pages4
DOIs
StatePublished - 2010
Event2nd International Workshop on Education Technology and Computer Science, ETCS 2010 - Wuhan, Hubei, China
Duration: 6 Mar 20107 Mar 2010

Publication series

Name2nd International Workshop on Education Technology and Computer Science, ETCS 2010
Volume2

Conference

Conference2nd International Workshop on Education Technology and Computer Science, ETCS 2010
Country/TerritoryChina
CityWuhan, Hubei
Period6/03/107/03/10

Keywords

  • Connected component
  • Document image analysis
  • Run-based
  • Tree

Fingerprint

Dive into the research topics of 'Run-based approach to labeling connected components in document images'. Together they form a unique fingerprint.

Cite this