跳到主要导航 跳到搜索 跳到主要内容

DeepText: Detecting text from the wild with multi-ASPP-assembled deeplab

  • Qingqing Wang
  • , Wenjing Jia
  • , Xiangjian He
  • , Yue Lu
  • , Michael Blumenstein
  • , Ye Huang
  • , Shujing Lyu

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this paper, we address the issue of scene text detection in the way of direct regression and successfully adapt an effective semantic segmentation model, DeepLab v3+ [1], for this application. In order to handle texts with arbitrary orientations and sizes and improve the recall of small texts, we propose to extract features of multiple scales by inserting multiple Atrous Spatial Pyramid Pooling (ASPP) layers to the DeepLab after the feature maps with different resolutions. Then, we set multiple auxiliary IoU losses at the decoding stage and make auxiliary connections from the intermediate encoding layers to the decoder to assist network training and enhance the discrimination ability of lower encoding layers. Experiments conducted on the benchmark scene text dataset ICDAR2015 demonstrate the superior performance of our proposed network, named as DeepText, over the state-of-the-art approaches.

源语言英语
主期刊名Proceedings - 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
出版商IEEE Computer Society
208-213
页数6
ISBN(电子版)9781728128610
DOI
出版状态已出版 - 9月 2019
活动15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019 - Sydney, 澳大利亚
期限: 20 9月 201925 9月 2019

出版系列

姓名Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN(印刷版)1520-5363

会议

会议15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019
国家/地区澳大利亚
Sydney
时期20/09/1925/09/19

指纹

探究 'DeepText: Detecting text from the wild with multi-ASPP-assembled deeplab' 的科研主题。它们共同构成独一无二的指纹。

引用此