跳到主要导航 跳到搜索 跳到主要内容

Low-Redundancy Knowledge Generation and Modality-Aware Interaction for Multimodal Information Extraction in Social Media

  • Shizhou Huang
  • , Bo Xu
  • , Changqun Li
  • , Yang Yu
  • , Xin Lin*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Multimodal information extraction (MIE) has gained increasing attention, as it helps to accomplish information extraction by adding images as auxiliary information. By acquiring entity-related knowledge, knowledge generation methods can effectively enhance the performance of information extraction models. However, current knowledge generation methods have two weaknesses: (1) they often generate knowledge that includes task-irrelevant information causing redundancy and negatively impacting model performance; (2) they typically concatenate knowledge and text input directly together, ignoring the stylistic and contextual differences arising from their different sources. To address these issues, we propose Low-Redundancy Knowledge Generation and Modality-Aware Interaction (LRKG-MAI). Our approach leverages a large language model to generate task-relevant knowledge with minimal redundancy, while treating knowledge as a distinct modality that interacts with text within its own representation space. Extensive experiments demonstrate the effectiveness of our approach. The source code can be found at https://github.com/JinFish/LRKG-MAI.

源语言英语
主期刊名2025 IEEE International Conference on Multimedia and Expo
主期刊副标题Journey to the Center of Machine Imagination, ICME 2025 - Conference Proceedings
出版商IEEE Computer Society
ISBN(电子版)9798331594954
DOI
出版状态已出版 - 2025
活动2025 IEEE International Conference on Multimedia and Expo, ICME 2025 - Nantes, 法国
期限: 30 6月 20254 7月 2025

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2025 IEEE International Conference on Multimedia and Expo, ICME 2025
国家/地区法国
Nantes
时期30/06/254/07/25

指纹

探究 'Low-Redundancy Knowledge Generation and Modality-Aware Interaction for Multimodal Information Extraction in Social Media' 的科研主题。它们共同构成独一无二的指纹。

引用此