A Rate-Distortion Framework for Explaining Black-Box Model Decisions.

International Conference on Machine Learning(2021)

引用 13|浏览20
暂无评分
摘要
We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework’s adaptability to diverse data modalities, particularly images, audio, and physical simulations of urban environments.
更多
查看译文
关键词
model,rate-distortion,black-box
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要