A New Deep Fuzzy Based MSER Model for Multiple Document Images Classification

Kunal Biswas, Palaiahnakote Shivakumara, Sittravell Sivanthi, Umapada Pal, Yue Lu, Cheng Lin Liu, Mohamad Nizam Bin Ayub

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Understanding document images uploaded on social media is challenging because of multiple types like handwritten, printed and scene text images. This study presents a new model called Deep Fuzzy based MSER for classification of multiple document images (like handwritten, printed and scene text). The proposed model detects candidate components that represent dominant information irrespective of the type of document images by combining fuzzy and MSER in a novel way. For every candidate component, the proposed model extracts distance-based features which result in proximity matrix (feature matrix). Further, the deep learning model is proposed for classification by feeding input images and feature matrix as input. To evaluate the proposed model, we create our own dataset and to show effectiveness, the proposed model is tested on standard datasets. The results show that the proposed approach outperforms the existing methods in terms of average classification rate.

Original languageEnglish
Title of host publicationPattern Recognition and Artificial Intelligence - 3rd International Conference, ICPRAI 2022, Proceedings
EditorsMounîm El Yacoubi, Eric Granger, Pong Chi Yuen, Umapada Pal, Nicole Vincent
PublisherSpringer Science and Business Media Deutschland GmbH
Pages358-370
Number of pages13
ISBN (Print)9783031090363
DOIs
StatePublished - 2022
Event3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022 - Paris, France
Duration: 1 Jun 20223 Jun 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13363 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022
Country/TerritoryFrance
CityParis
Period1/06/223/06/22

Keywords

  • Document classification
  • Document image analysis
  • Document image understanding
  • Handwritten documents understanding
  • Scene text recognition

Fingerprint

Dive into the research topics of 'A New Deep Fuzzy Based MSER Model for Multiple Document Images Classification'. Together they form a unique fingerprint.

Cite this