MT-YOLOv5: Mobile terminal table detection model based on YOLOv5

  • Zixin Ning
  • , Xinjiao Wu
  • , Jing Yang*
  • , Yanqin Yang
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Abstract

Table detection is an important task of optical character recognition(OCR). At present, table detection for desktop applications has basically reached commercial requirements. With the advancement of informatization, personal demand for table detection has gradually increased. There is an urgent need to establish a table detection method that can be deployed on handheld devices. This paper proposes a mobile terminal table detection model based on YOLOv5. First, we used YOLOv5 as the main framework of the model. However, considering the problem of connection redundancy in the backbone of YOLOv5, on the basis of retaining the YOLOv5 multi-scale detection head, we replaced the backbone of YOLOv5 with the same excellent Mobilenetv2. In addition, considering the non-linear defects of the lightweight model, we use deformable convolution to make up for it. This paper has been evaluated on the ICDAR 2019 dataset, and the results show that compared with the baseline model, the model reduces the number of parameters by half and increases the detection speed by 47%. At the same time, the model can reach 35.25 FPS on ordinary Android phones.

Original languageEnglish
Article number012010
JournalJournal of Physics: Conference Series
Volume1978
Issue number1
DOIs
StatePublished - 27 Jul 2021
Event4th International Conference on Physics, Mathematics and Statistics, ICPMS 2021 - Kunming, China
Duration: 19 May 202121 May 2021

Fingerprint

Dive into the research topics of 'MT-YOLOv5: Mobile terminal table detection model based on YOLOv5'. Together they form a unique fingerprint.

Cite this