The Mirror Agent Model: A Bayesian Architecture for Interpretable Agent Behavior

Explainable and Transparent AI and Multi-Agent Systems（2022）

引用 1|浏览6

暂无评分

摘要

In this paper we illustrate a novel architecture generating interpretable behavior and explanations. We refer to this architecture as the Mirror Agent Model because it defines the observer model, that is the target of explicit and implicit communications, as a mirror of the agent’s. With the goal of providing a general understanding of this work, we firstly show prior relevant results addressing the informative communication of agents intentions and the production of legible behavior. In the second part of the paper we furnish the architecture with novel capabilities for explanations through off-the-shelf saliency methods, followed by preliminary qualitative results.

查看译文

关键词

Interpretability, Explainability, Bayesian networks, Mirror Agent Model

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要