Bangla/English script identification based on analysis of connected component profiles

  • Lijun Zhou*
  • , Yue Lu
  • , Chew Lim Tan
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

64 Scopus citations

Abstract

Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with applications to the destination address block of Bangladesh envelope images. The proposed approach is based upon the analysis of connected component profiles extracted from the destination address block images, however, it does not place any emphasis on the information provided by individual characters themselves and does not require any character/line segmentation. Experimental results demonstrate that the proposed technique is capable of identifying Bangla/English scripts on the real Bangladesh postal images.

Original languageEnglish
Title of host publicationDocument Analysis Systems VII - 7th International Workshop, DAS 2006, Proceedings
PublisherSpringer Verlag
Pages243-254
Number of pages12
ISBN (Print)3540321403, 9783540321408
DOIs
StatePublished - 2006
Event7th International Workshop on Document Analysis Systems, DAS 2006 - Nelson, New Zealand
Duration: 13 Feb 200615 Feb 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3872 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Workshop on Document Analysis Systems, DAS 2006
Country/TerritoryNew Zealand
CityNelson
Period13/02/0615/02/06

Fingerprint

Dive into the research topics of 'Bangla/English script identification based on analysis of connected component profiles'. Together they form a unique fingerprint.

Cite this