Handwritten Digit String Recognition for Indian Scripts

Hongjian Zhan*, Pinaki Nath Chowdhury, Umapada Pal, Yue Lu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In many documents digits/numerals may touch each other and hence digit string recognition is necessary as segmentation of individual numeral from the touching string is difficult. In this paper, we propose a digit string recognition system for four Indian popular scripts. Here we consider strings of Kannada, Oriya, Tamil and Telugu scripts for our experiment. This paper has two contributions: (i) we have developed 4 datasets of digit string for each of these four scripts. Each dataset has 20000 numeral string samples for training and 30000 samples for testing. As there is no such dataset available, it will be helpful to the community (ii) we apply a RNN free CNN (Convolutional Neural Network) and CTC (Connectionist Temporal Classifica-tion) based architecture for numeral string recognition. Unlike normal text string, in string of digits has no contextual information among the digits and hence a digit may be followed by an arbitrary digit in a digit string. Because of such behaviors we apply a CNN and CTC based architecture without RNN for numeral string recognition. We tested our scheme on our different test datasets and results are provided.

Original languageEnglish
Title of host publicationPattern Recognition - 5th Asian Conference, ACPR 2019, Revised Selected Papers
EditorsShivakumara Palaiahnakote, Gabriella Sanniti di Baja, Liang Wang, Wei Qi Yan
PublisherSpringer
Pages262-273
Number of pages12
ISBN (Print)9783030412982
DOIs
StatePublished - 2020
Event5th Asian Conference on Pattern Recognition, ACPR 2019 - Auckland, New Zealand
Duration: 26 Nov 201929 Nov 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12047 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th Asian Conference on Pattern Recognition, ACPR 2019
Country/TerritoryNew Zealand
CityAuckland
Period26/11/1929/11/19

Keywords

  • Connectionist Temporal Classification
  • Convolutional Neural Network
  • Postal Automation
  • String recognition

Fingerprint

Dive into the research topics of 'Handwritten Digit String Recognition for Indian Scripts'. Together they form a unique fingerprint.

Cite this