Handwritten and machine printed text discrimination using an edge co-occurrence matrix

  • Xiaofeng Zhang*
  • , Yue Lu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

We employ an edge co-occurrence matrix (ECM) to distinguish handwritten and machine printed text without resorting to line or word information. The ECM is a modified co-occurrence matrix (CM) on edge images. First, the whole image is divided into overlapping range blocks with fixed size. Then, the ECMs are abstracted from these blocks. The ECM only counts the co-occurring edges connected with each other and its up direction part is the part with most distribution. The liner Support Vector Machine (SVM) is used to classify the features. Because of the similarities of neighboring blocks, the Discriminative Random Fields (DRF) is used to further improve the classification accuracy. The experiments on document images taken from HIT and IMA databases show the effectiveness of our proposed method.

Original languageEnglish
Title of host publicationICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings
Pages828-831
Number of pages4
DOIs
StatePublished - 2012
Event2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012 - Shanghai, China
Duration: 16 Jul 201218 Jul 2012

Publication series

NameICALIP 2012 - 2012 International Conference on Audio, Language and Image Processing, Proceedings

Conference

Conference2012 3rd IEEE/IET International Conference on Audio, Language and Image Processing, ICALIP 2012
Country/TerritoryChina
CityShanghai
Period16/07/1218/07/12

Fingerprint

Dive into the research topics of 'Handwritten and machine printed text discrimination using an edge co-occurrence matrix'. Together they form a unique fingerprint.

Cite this