跳到主要导航 跳到搜索 跳到主要内容

Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition[Formula presented]

  • Dajian Zhong
  • , Shujing Lyu
  • , Palaiahankote Shivakumara
  • , Umapada Pal
  • , Yue Lu*
  • *此作品的通讯作者
  • East China Normal University
  • University of Malaya
  • Indian Statistical Institute

科研成果: 期刊稿件文章同行评审

摘要

Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-Awareness-Attention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https://codeocean.com/capsule/5666319/tree/v1.

源语言英语
文章编号117564
期刊Expert Systems with Applications
205
DOI
出版状态已出版 - 1 11月 2022

指纹

探究 'Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition[Formula presented]' 的科研主题。它们共同构成独一无二的指纹。

引用此