How to Explain Neural Networks: an Approximation Perspective

Hangcheng Dong,Bingguo Liu,Fengdong Chen,Dong Ye,Guodong Liu

semanticscholar（2021）

引用 0|浏览1

暂无评分

摘要

The lack of interpretability has hindered the large-scale adoption of AI technologies. However, the fundamental idea of interpretability, as well as how to put it into practice, remains unclear. We provide notions of interpretability based on approximation theory in this study. We first implement this approximation interpretation on a specific model (fully connected neural network) and then propose to use MLP as a universal interpreter to explain arbitrary black-box models. Extensive experiments demonstrate the effectiveness of our approach.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要