跳到主要导航 跳到搜索 跳到主要内容

Structured diversification emergence via reinforced organization control and hierarchical consensus learning

  • Wenhao Li
  • , Xiangfeng Wang*
  • , Bo Jin*
  • , Junjie Sheng
  • , Yun Hua
  • , Hongyuan Zha
  • *此作品的通讯作者
  • East China Normal University
  • The Chinese University of Hong Kong, Shenzhen

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

When solving a complex task, humans will spontaneously form teams and to complete different parts of the whole task, respectively. Meanwhile, the cooperation between teammates will improve efficiency. However, for current cooperative MARL methods, the cooperation team is constructed through either heuristics or end-to-end blackbox optimization. In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named Rochico based on reinforced organization control and hierarchical consensus learning. Rochico first learns an adaptive grouping policy through the organization control module, which is established by independent multi-agent reinforcement learning. Further, the hierarchical consensus module based on the hierarchical intentions with consensus constraint is introduced after team formation. Simultaneously, utilizing the hierarchical consensus module and a self-supervised intrinsic reward enhanced decision module, the proposed cooperative MARL algorithm Rochico can output the final diversified multi-agent cooperative policy. All three modules are organically combined to promote the structured diversification emergence. Comparative experiments on four large-scale cooperation tasks show that Rochico is significantly better than the current SOTA algorithms in terms of exploration efficiency and cooperation strength.

源语言英语
主期刊名20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021
出版商International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
773-781
页数9
ISBN(电子版)9781713832621
出版状态已出版 - 2021
活动20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021 - Virtual, Online
期限: 3 5月 20217 5月 2021

出版系列

姓名Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
2
ISSN(印刷版)1548-8403
ISSN(电子版)1558-2914

会议

会议20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021
Virtual, Online
时期3/05/217/05/21

指纹

探究 'Structured diversification emergence via reinforced organization control and hierarchical consensus learning' 的科研主题。它们共同构成独一无二的指纹。

引用此