跳到主要导航 跳到搜索 跳到主要内容

MT-YOLOv5: Mobile terminal table detection model based on YOLOv5

  • Zixin Ning
  • , Xinjiao Wu
  • , Jing Yang*
  • , Yanqin Yang
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件会议文章同行评审

摘要

Table detection is an important task of optical character recognition(OCR). At present, table detection for desktop applications has basically reached commercial requirements. With the advancement of informatization, personal demand for table detection has gradually increased. There is an urgent need to establish a table detection method that can be deployed on handheld devices. This paper proposes a mobile terminal table detection model based on YOLOv5. First, we used YOLOv5 as the main framework of the model. However, considering the problem of connection redundancy in the backbone of YOLOv5, on the basis of retaining the YOLOv5 multi-scale detection head, we replaced the backbone of YOLOv5 with the same excellent Mobilenetv2. In addition, considering the non-linear defects of the lightweight model, we use deformable convolution to make up for it. This paper has been evaluated on the ICDAR 2019 dataset, and the results show that compared with the baseline model, the model reduces the number of parameters by half and increases the detection speed by 47%. At the same time, the model can reach 35.25 FPS on ordinary Android phones.

源语言英语
文章编号012010
期刊Journal of Physics: Conference Series
1978
1
DOI
出版状态已出版 - 27 7月 2021
活动4th International Conference on Physics, Mathematics and Statistics, ICPMS 2021 - Kunming, 中国
期限: 19 5月 202121 5月 2021

指纹

探究 'MT-YOLOv5: Mobile terminal table detection model based on YOLOv5' 的科研主题。它们共同构成独一无二的指纹。

引用此