RL-CAM: Visual Explanations for Convolutional Networks using Reinforcement Learning.

Soumyendu Sarkar,Ashwin Ramesh Babu,Sajad Mousavi,Sahand Ghorbanpour,Vineet Gundecha,Antonio Guillen, Ricardo Luna Gutierrez,Avisek Naug

CVPR Workshops（2023）

引用 4|浏览5

暂无评分

摘要

Convolutional Neural Networks (CNNs) are state-of-the-art models for computer vision tasks such as image classification, object detection, and segmentation. However, these models suffer from their inability to explain decisions, particularly in fields like healthcare and security, where interpretability is critical. Previous research has developed various methods for interpreting CNNs, including visualization-based approaches (e.g., saliency maps) that aim to reveal the underlying features used by the model to make predictions. In this work, we propose a novel approach that uses reinforcement learning to generate a visual explanation for CNNs. Our method considers the black-box CNN model and relies solely on the probability distribution of the model’s output to localize the features contributing to a particular prediction. The proposed reinforcement learning algorithm has an agent with two actions, a forward action that explores the input image and identifies the most sensitive region to generate a localization mask, and a reverse action that fine-tunes the localization mask. We evaluate the performance of our approach using multiple image segmentation metrics and compare it with existing visualization-based methods. The experimental results demonstrate that our proposed method outperforms the existing techniques, producing more accurate localization masks of regions of interest in the input images.

查看译文

关键词

black-box CNN model,CNNs,computer vision tasks,convolutional Networks,Convolutional Neural Networks,existing visualization-based methods,forward action,healthcare,image classification,input image,localization mask,multiple image segmentation metrics,object detection,particular prediction,probability distribution,reinforcement learning algorithm,reverse action,RL-CAM,saliency maps,security,underlying features,visual explanation,visualization-based approaches

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要