D3C: Reducing the Price of Anarchy in Multi-Agent Learning.

Ian Gemp,Kevin R. McKee,Richard Everett,Edgar A. Duéñez-Guzmán,Yoram Bachrach,David Balduzzi,Andrea Tacchetti

International Joint Conference on Autonomous Agents and Multi-agent Systems（2022）

引用 7|浏览146

暂无评分

摘要

Even in simple multi-agent systems, fixed incentives can lead to outcomes that are poor for the group and each individual agent. We propose a method, D3C, for online adjustment of agent incentives that reduces the loss incurred at a Nash equilibrium. Agents adjust their incentives by learning to mix their incentive with that of other agents, until a compromise is reached in a distributed fashion. We show that D3C improves outcomes for each agent and the group as a whole on several social dilemmas including a traffic network with Braess's paradox, a prisoner's dilemma, and several reinforcement learning domains.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要