跳到主要导航 跳到搜索 跳到主要内容

Enhancing Out-of-Distribution Generalization in VQA through Gini Impurity-guided Adaptive Margin Loss

  • Shuwen Yang
  • , Tianyu Huai
  • , Anran Wu
  • , Xingjiao Wu
  • , Wenxin Hu*
  • , Liang He*
  • *此作品的通讯作者
  • East China Normal University
  • Fudan University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In the Visual Question Answering (VQA) task context, most methods are influenced by language bias, resulting in poor performance on out-of-distribution data. Recently, some works attempted to use the adaptive margin loss to address this bias issue. However, these works typically consider only the frequency of answer labels when designing margin loss, leading to some samples being overly emphasized or lacking sufficient attention during model training. To address this issue, we propose a novel margin loss guided by the Gini-impurity for VQA debiasing. By comprehensively considering label distribution and instance complexity, we use Gini impurity to adjust the margin values in margin loss, balancing the attention of the model to different samples. Importantly, our method is plug-and-play and can be directly applied to any baseline. In the VQA-CP v2 task, our evaluation results across various baselines surpass the current state-of-the-art methods.

源语言英语
主期刊名2024 IEEE International Conference on Multimedia and Expo, ICME 2024
出版商IEEE Computer Society
ISBN(电子版)9798350390155
DOI
出版状态已出版 - 2024
活动2024 IEEE International Conference on Multimedia and Expo, ICME 2024 - Niagra Falls, 加拿大
期限: 15 7月 202419 7月 2024

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2024 IEEE International Conference on Multimedia and Expo, ICME 2024
国家/地区加拿大
Niagra Falls
时期15/07/2419/07/24

指纹

探究 'Enhancing Out-of-Distribution Generalization in VQA through Gini Impurity-guided Adaptive Margin Loss' 的科研主题。它们共同构成独一无二的指纹。

引用此