跳到主要导航 跳到搜索 跳到主要内容

Using multiple sequence alignment and statistical language model to integrate multiple Chinese address recognition outputs

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Different recognizers may result in different mistakes when they are used to recognize a Chinese address. In this paper, we present a method of combining multiple Chinese address recognition outputs to improve Chinese address recognition accuracy. The method first employs multiple sequence alignment to generate a lattice of candidate hypotheses from multiple different recognizer outputs and then applies statistical language model to choose the maximum likelihood candidate sequence. Taking the maximum as the final decision, the performance of our method is superior, compared to the single recognizers and Miyao's method. The experiments on the address images of real envelopes demonstrate that the proposed method increases the character recognition accuracy rate from 95.80% to 98.38%, with 61.30% error reduction. Furthermore, the corrected sorting rate of an automatic mail sorting system increases from 84.11% to 93.72%.

源语言英语
主期刊名13th IAPR International Conference on Document Analysis and Recognition, ICDAR 2015 - Conference Proceedings
出版商IEEE Computer Society
151-155
页数5
ISBN(电子版)9781479918058
DOI
出版状态已出版 - 20 11月 2015
活动13th International Conference on Document Analysis and Recognition, ICDAR 2015 - Nancy, 法国
期限: 23 8月 201526 8月 2015

出版系列

姓名Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
2015-November
ISSN(印刷版)1520-5363

会议

会议13th International Conference on Document Analysis and Recognition, ICDAR 2015
国家/地区法国
Nancy
时期23/08/1526/08/15

指纹

探究 'Using multiple sequence alignment and statistical language model to integrate multiple Chinese address recognition outputs' 的科研主题。它们共同构成独一无二的指纹。

引用此