跳到主要导航 跳到搜索 跳到主要内容

Image Layer Modeling for Complex Document Layout Generation

  • Tianlong Ma
  • , Xingjiao Wu
  • , Xiangcheng Du
  • , Yanlong Wang
  • , Cheng Jin*
  • *此作品的通讯作者
  • Fudan University
  • Shanxi University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Document layout analysis (DLA) plays an essential role in information extraction and document understanding. At present, DLA has reached the milestone achievement; however, DLA of non-Manhattan is still challenging because of annotation data limitations. In this paper, we propose an image layer modeling method to mitigate this issue. The image layer modeling method generates document images of non-Manhattan layouts by superimposing images under pre-defined aesthetic rules. Due to the lack of evaluation benchmark for non-Manhattan layout, we have constructed a manually-labeled non-Manhattan layout fine-grained segmentation dataset. To the best of our knowledge, this is the first manually-labeled non-Manhattan layout fine-grained segmentation dataset. Extensive experimental results verify that our proposed image layer modeling method can better deal with the fine-grained segmented document of the non-Manhattan layout.

源语言英语
主期刊名Proceedings - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023
出版商IEEE Computer Society
2261-2266
页数6
ISBN(电子版)9781665468916
DOI
出版状态已出版 - 2023
已对外发布
活动2023 IEEE International Conference on Multimedia and Expo, ICME 2023 - Brisbane, 澳大利亚
期限: 10 7月 202314 7月 2023

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
2023-July
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2023 IEEE International Conference on Multimedia and Expo, ICME 2023
国家/地区澳大利亚
Brisbane
时期10/07/2314/07/23

指纹

探究 'Image Layer Modeling for Complex Document Layout Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此