Symbol Location-Aware Network for Improving Handwritten Mathematical Expression Recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Recently most handwritten mathematical expression recognition methods adopt the attention-based encoder-decoder framework, which generates LaTeX sequences from given images. However, the accuracy of the attention mechanism limits the performance of HMER models. Lacking global context information in the decoding process is also a challenge for HMER. Some methods adopt symbol-level counting to localize symbols for improving the model performance, while these methods cannot work well. In this paper, we propose a method named SLAN, shorted for a Symbol Location-Aware Network, to solve the HMER problem. Specifically, we propose an advanced relation-level counting method to detect symbols in the image. We solve the lacking global context problem with a new global context-aware decoder. For improving the accuracy of attention, we design a novel attention alignment loss function by the dynamic programming algorithm, which can learn attention alignment directly without pixel-level labels. We conducted extensive experiments on the CROHME dataset to demonstrate the effectiveness of each part of SLAN and achieved state-of-the-art performance.

Original languageEnglish
Title of host publicationICMR 2023 - Proceedings of the 2023 ACM International Conference on Multimedia Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages516-524
Number of pages9
ISBN (Electronic)9798400701788
DOIs
StatePublished - 12 Jun 2023
Event2023 ACM International Conference on Multimedia Retrieval, ICMR 2023 - Thessaloniki, Greece
Duration: 12 Jun 202315 Jun 2023

Publication series

NameICMR 2023 - Proceedings of the 2023 ACM International Conference on Multimedia Retrieval

Conference

Conference2023 ACM International Conference on Multimedia Retrieval, ICMR 2023
Country/TerritoryGreece
CityThessaloniki
Period12/06/2315/06/23

Keywords

  • dynamic programming
  • global context
  • handwritten mathematical expression recognition
  • symbol counting

Fingerprint

Dive into the research topics of 'Symbol Location-Aware Network for Improving Handwritten Mathematical Expression Recognition'. Together they form a unique fingerprint.

Cite this