Learning Roles with Emergent Social Value Orientations

Research output: Contribution to journalArticlepeer-review

Abstract

Social dilemmas can be considered situations where individual rationality leads to collective irrationality. The multi-agent reinforcement learning community has leveraged ideas from social science, such as social value orientations (SVO), to solve social dilemmas in complex cooperative tasks. In this paper, we first introduce the typical “division of labor or roles” mechanism in human society, and provide a promising solution for intertemporal social dilemmas (ISD) with SVOs. A novel learning framework, called Learning Roles with Emergent SVOs (RESVO), is proposed to transform the learning of roles into the social value orientation emergence, which is symmetrically solved by endowing agents with altruism to share rewards with other agents. An SVO-based role embedding space is then constructed by individual conditioning policies on roles with a novel rank regularizer and mutual information maximizer. Experiments show that RESVO achieves a stable division of labor and cooperation in ISDs with different complexity.

Keywords

  • Division of Labor
  • Multi-Agent Reinforcement Learning
  • Social Dilemma
  • Social Value Orientation

Fingerprint

Dive into the research topics of 'Learning Roles with Emergent Social Value Orientations'. Together they form a unique fingerprint.

Cite this