The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability
arxiv(2024)
摘要
Interpretability techniques are valuable for helping humans understand and
oversee AI systems. The SaTML 2024 CNN Interpretability Competition solicited
novel methods for studying convolutional neural networks (CNNs) at the ImageNet
scale. The objective of the competition was to help human crowd-workers
identify trojans in CNNs. This report showcases the methods and results of four
featured competition entries. It remains challenging to help humans reliably
diagnose trojans via interpretability tools. However, the competition's entries
have contributed new techniques and set a new record on the benchmark from
Casper et al., 2023.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要