跳到主要导航 跳到搜索 跳到主要内容

Length-aware center loss for sequence to sequence Thai scene text recognition

  • Hongjian Zhan*
  • , Chun Li
  • , Bing Yin
  • , Yue Lu*
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Thai scene text recognition is a challenging task because Thai can be written in both horizontal and vertical directions, allowing characters to be stacked vertically. To address this issue, our previous work combined vertically stacked characters to create new characters. However, this strategy introduced many similar characters. In this paper, we further investigate this problem and propose the Length-aware Center Loss (LC) for Thai scene text recognition. The original center loss was designed for single object recognition tasks. When applied to multi-label tasks like text recognition, center loss is only effective when the lengths of the labels and prediction results are consistent. This can lead to an extreme case where all images receive incorrect predicted text lengths to minimize loss, severely interfering with the recognition process. Therefore, we propose the Length-aware Center Loss for text recognition. We also design the Length Supervision Module (LSM) and the Feature Clustering Module (FCM) to work alongside the LC loss. LSM predicts text length to provide additional supervision signals, while FCM aims to improve recognition performance by minimizing the distance between the features of corresponding class centers. Since there is no publicly available Thai scene text dataset, we have collected a new dataset containing more than 170,000 samples. Extensive experiments conducted on this dataset show that our method achieves superior performance in both string-level and character-level accuracy compared to other methods.

源语言英语
文章编号112182
期刊Engineering Applications of Artificial Intelligence
161
DOI
出版状态已出版 - 9 12月 2025
已对外发布

指纹

探究 'Length-aware center loss for sequence to sequence Thai scene text recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此