TY - JOUR
T1 - Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition[Formula presented]
AU - Zhong, Dajian
AU - Lyu, Shujing
AU - Shivakumara, Palaiahankote
AU - Pal, Umapada
AU - Lu, Yue
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/11/1
Y1 - 2022/11/1
N2 - Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-Awareness-Attention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https://codeocean.com/capsule/5666319/tree/v1.
AB - Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-Awareness-Attention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https://codeocean.com/capsule/5666319/tree/v1.
KW - Attention model
KW - Location-awareness-attention model
KW - Scene text detection
KW - Scene text recognition
KW - Text proposal
UR - https://www.scopus.com/pages/publications/85131256478
U2 - 10.1016/j.eswa.2022.117564
DO - 10.1016/j.eswa.2022.117564
M3 - 文章
AN - SCOPUS:85131256478
SN - 0957-4174
VL - 205
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 117564
ER -