跳到主要导航 跳到搜索 跳到主要内容

DOCUMENT LAYOUT ANALYSIS VIA DYNAMIC RESIDUAL FEATURE FUSION

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The document layout analysis (DLA) aims to split the document image into different interest regions and understand the role of each region, which has wide application such as optical character recognition (OCR) systems and document retrieval. However, it is a challenge to build a DLA system because the training data is very limited and lacks an efficient model. In this paper, we propose an end-to-end united network named Dynamic Residual Fusion Network (DRFN) for the DLA task. Specifically, we design a dynamic residual feature fusion module which can fully utilize low-dimensional information and maintain high-dimensional category information. Besides, to deal with the model overfitting problem that is caused by lacking enough data, we propose the dynamic select mechanism for efficient fine-tuning in limited train data. We experiment with two challenging datasets and demonstrate the effectiveness of the proposed module.

源语言英语
主期刊名2021 IEEE International Conference on Multimedia and Expo, ICME 2021
出版商IEEE Computer Society
ISBN(电子版)9781665438643
DOI
出版状态已出版 - 2021
活动2021 IEEE International Conference on Multimedia and Expo, ICME 2021 - Shenzhen, 中国
期限: 5 7月 20219 7月 2021

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2021 IEEE International Conference on Multimedia and Expo, ICME 2021
国家/地区中国
Shenzhen
时期5/07/219/07/21

指纹

探究 'DOCUMENT LAYOUT ANALYSIS VIA DYNAMIC RESIDUAL FEATURE FUSION' 的科研主题。它们共同构成独一无二的指纹。

引用此