跳到主要导航 跳到搜索 跳到主要内容

Handwriting and Hand-Sketched Graphics Detection Using Convolutional Neural Networks

  • Song Yang Cheng
  • , Yu Jie Xiong*
  • , Jun Qing Zhang
  • , Yan Chun Cao
  • *此作品的通讯作者
  • Shanghai University of Engineering Science

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Handwriting and hand-sketched graphics carry rich information to reveal the insights of the physical and emotional state of the writer. Before analyzing the personal traits of handwriting and hand-sketched graphics, detecting them from the image is the most immediate subproblem in handwritten document analysis and understanding. In this paper, we introduce two Convolutional Neural Networks (CNN) based methods to extract multimodal information (handwriting and hand-sketched graphics) from questionnaire documents. A Connectionist Text Proposal Network (CTPN) based method is proposed to detect handwriting. The first stage employs the VGG-16 model to generate the convolutional feature maps of the original images. Then the second stage adopts a BLSTM based detector to predict the scores of candidate zones. An instance segmentation method using the Mask Region Convolutional Neural Network (Mask-RCNN) is also proposed to solve hand-sketched graphics detection problem. The Mask-RCNN based approach has two parts: the backbone and the head. The backbone is to extract the features over the original image, and the head is to perform bounding boxes regression and mask prediction. At first, a simple Region Proposal Network (RPN) is adopted to generate the proposals of hand-sketched graphics efficiently. Then, the Region of Interest (RoI) features of the above proposals are fed into the Fast-RCNN branch and the mask branch to obtain the bounding boxes and the graphics segmentation results. The best handwriting detection performance of 200 test questionnaire images is that the precision rate is 99.5%, the recall rate is 99.2% and the F-measure score is 99.4%. The best detection performance of hand-sketched graphics is that the precision rate is 99.0%, the recall rate is 99.5% and the F-measure score is 99.3%. Experiments demonstrate that the proposed methods achieve promising results in both handwriting and hand-sketched graphics detection tasks.

源语言英语
主期刊名Pattern Recognition and Artificial Intelligence - International Conference, ICPRAI 2020, Proceedings
编辑Yue Lu, Nicole Vincent, Pong Chi Yuen, Wei-Shi Zheng, Farida Cheriet, Ching Y. Suen
出版商Springer Science and Business Media Deutschland GmbH
352-362
页数11
ISBN(印刷版)9783030598297
DOI
出版状态已出版 - 2020
活动2nd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2020 - Zhongshan, 中国
期限: 19 10月 202023 10月 2020

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12068 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议2nd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2020
国家/地区中国
Zhongshan
时期19/10/2023/10/20

指纹

探究 'Handwriting and Hand-Sketched Graphics Detection Using Convolutional Neural Networks' 的科研主题。它们共同构成独一无二的指纹。

引用此