跳到主要导航 跳到搜索 跳到主要内容

MindScope: Exploring Cognitive Biases in Large Language Models Through Multi-Agent Systems

  • Zhentao Xie
  • , Jiabao Zhao*
  • , Yilei Wang
  • , Jinxin Shi
  • , Yanhong Bai
  • , Xingjiao Wu
  • , Liang He
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Detecting cognitive biases in large language models (LLMs) is a fascinating task that aims to probe the existing cognitive biases within these models.Current methods for detecting cognitive biases in language models generally suffer from incomplete detection capabilities and a restricted range of detectable bias types.To address this issue, we introduced the’MindScope’ dataset, which distinctively integrates static and dynamic elements.The static component comprises 5,170 open-ended questions spanning 72 cognitive bias categories.The dynamic component leverages a rule-based, multi-agent communication framework to facilitate the generation of multi-round dialogues.This framework is flexible and readily adaptable for various psychological experiments involving LLMs.In addition, we introduce a multi-agent detection method applicable to a wide range of detection tasks, which integrates Retrieval-Augmented Generation (RAG), competitive debate, and a reinforcement learning-based decision module.Demonstrating substantial effectiveness, this method has shown to improve detection accuracy by as much as 35.10% compared to GPT-4.Codes and appendix are available at https://github.com/2279072142/MindScope.

源语言英语
主期刊名ECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings
编辑Ulle Endriss, Francisco S. Melo, Kerstin Bach, Alberto Bugarin-Diz, Jose M. Alonso-Moral, Senen Barro, Fredrik Heintz
出版商IOS Press BV
3308-3315
页数8
ISBN(电子版)9781643685489
DOI
出版状态已出版 - 16 10月 2024
活动27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, 西班牙
期限: 19 10月 202424 10月 2024

出版系列

姓名Frontiers in Artificial Intelligence and Applications
392
ISSN(印刷版)0922-6389
ISSN(电子版)1879-8314

会议

会议27th European Conference on Artificial Intelligence, ECAI 2024
国家/地区西班牙
Santiago de Compostela
时期19/10/2424/10/24

指纹

探究 'MindScope: Exploring Cognitive Biases in Large Language Models Through Multi-Agent Systems' 的科研主题。它们共同构成独一无二的指纹。

引用此