跳到主要导航 跳到搜索 跳到主要内容

Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks

  • Yunqi Zhang
  • , Songda Li
  • , Chunyuan Deng
  • , Luyi Wang
  • , Hui Zhao*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Gender bias in vision-language models (VLMs) can reinforce harmful stereotypes and discrimination. In this paper, we focus on mitigating gender bias towards vision-language tasks. We identify object hallucination as the essence of gender bias in VLMs. Existing VLMs tend to focus on salient or familiar attributes in images but ignore contextualized nuances. Moreover, most VLMs rely on the co-occurrence between specific objects and gender attributes to infer the ignored features, ultimately resulting in gender bias. We propose GAMA, a task-agnostic generation framework to mitigate gender bias. GAMA consists of two stages: narrative generation and answer inference. During narrative generation, GAMA yields all-sided but gender-obfuscated narratives, which prevents premature concentration on localized image features, especially gender attributes. During answer inference, GAMA integrates the image, generated narrative, and a task-specific question prompt to infer answers for different vision-language tasks. This approach allows the model to rethink gender attributes and answers. We conduct extensive experiments on GAMA, demonstrating its debiasing and generalization ability.

源语言英语
主期刊名Long Papers
编辑Kevin Duh, Helena Gomez, Steven Bethard
出版商Association for Computational Linguistics (ACL)
773-791
页数19
ISBN(电子版)9798891761148
DOI
出版状态已出版 - 2024
活动2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 - Hybrid, Mexico City, 墨西哥
期限: 16 6月 202421 6月 2024

出版系列

姓名Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
1

会议

会议2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
国家/地区墨西哥
Hybrid, Mexico City
时期16/06/2421/06/24

指纹

探究 'Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks' 的科研主题。它们共同构成独一无二的指纹。

引用此