TY - JOUR
T1 - Recognition of handwritten Chinese address with writing variations
AU - Wei, Xiaohua
AU - Lu, Shujing
AU - Wen, Ying
AU - Lu, Yue
N1 - Publisher Copyright:
© 2016 Elsevier B.V. All rights reserved.
PY - 2016/4/1
Y1 - 2016/4/1
N2 - Handwritten Chinese address recognition is a challenging task, not only because of the large quantity of Chinese characters and unconstraint of handwriting, but also due to irregularities of various address formats. The existing techniques generally solve the problem by transforming the address database to a large scale character-level-tree (CLT) and then utilizing the nodes of the generated CLT to match with the candidate patterns. However, the CLT is unable to cover all the variations of address formats. A more compact tree is proposed in this paper to cover the variations of address formats as many and complete as possible by building the structure tree at word level. Specifically, the segment candidate patterns are firstly recognized by a character classifier, then are mapped to candidate address words by matching with the proposed word-level-tree (WLT) address database. Finally, the address recognition result is obtained in the path matching phase by summing the scores of candidate address words in each match path. The proposed scheme was tested with real mail address images captured by an automatic letter sorting machine. Experimental results have demonstrated that the performance of the proposed WLT based method outperforms the four benchmarking methods.
AB - Handwritten Chinese address recognition is a challenging task, not only because of the large quantity of Chinese characters and unconstraint of handwriting, but also due to irregularities of various address formats. The existing techniques generally solve the problem by transforming the address database to a large scale character-level-tree (CLT) and then utilizing the nodes of the generated CLT to match with the candidate patterns. However, the CLT is unable to cover all the variations of address formats. A more compact tree is proposed in this paper to cover the variations of address formats as many and complete as possible by building the structure tree at word level. Specifically, the segment candidate patterns are firstly recognized by a character classifier, then are mapped to candidate address words by matching with the proposed word-level-tree (WLT) address database. Finally, the address recognition result is obtained in the path matching phase by summing the scores of candidate address words in each match path. The proposed scheme was tested with real mail address images captured by an automatic letter sorting machine. Experimental results have demonstrated that the performance of the proposed WLT based method outperforms the four benchmarking methods.
KW - Candidate address word
KW - Character classification
KW - Handwritten Chinese address recognition
KW - Word-level-tree
KW - Writing variations
UR - https://www.scopus.com/pages/publications/84958184398
U2 - 10.1016/j.patrec.2015.12.018
DO - 10.1016/j.patrec.2015.12.018
M3 - 文章
AN - SCOPUS:84958184398
SN - 0167-8655
VL - 73
SP - 68
EP - 75
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -