CNN-based Hindi numeral string recognition for Indian postal automation

  • Hongjian Zhan
  • , Shujing Lyu
  • , Umapada Pal
  • , Yue Lu*
  • *Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

4 Scopus citations

Abstract

Digits/numerals in the Indian pin-code of handwritten postal documents may touch each other and hence digit string recognition is a very challenging task. In this paper, we propose a digit string recognition system for Indian postal documents written in Hindi. Unlike normal text string, in a string of digits there is no contextual information among the digits as a digit may be followed by an arbitrary digit in a string of digits. Because of this, here we propose a new architecture which is based on CNN (Convolutional Neural Network) and CTC (Connectionist Temporal Classification), without using RNN for Hindi numeral string recognition. Also to connect CNN with CTC, we transform the outputs of CNN to a two-dimension vector to meet the feeding requirement of CTC. Furthermore, we utilize dense blocks to build CNN part to extract efficient image features. Comparative studies with the state-of-the-art methods show that the proposed method outperforms the other existing methods on Hindi numeral string recognition.

Original languageEnglish
Pages77-82
Number of pages6
DOIs
StatePublished - 2019
Event2nd International Workshop on Machine Learning, WML 2019 - ICDAR 2019 Workshop - Sydney, Australia
Duration: 21 Sep 201922 Sep 2019

Conference

Conference2nd International Workshop on Machine Learning, WML 2019 - ICDAR 2019 Workshop
Country/TerritoryAustralia
CitySydney
Period21/09/1922/09/19

Keywords

  • Connectionist temporal classification
  • Convolutional neural network
  • Hindi numeral string
  • Postal automation

Fingerprint

Dive into the research topics of 'CNN-based Hindi numeral string recognition for Indian postal automation'. Together they form a unique fingerprint.

Cite this