Explainability in Computer Vision Comparing Grad-CAM++ to CAM, Guided Grad-CAM and Integrated Gradients on the explainability of image classification on the Mini ImageNet dataset and the Indian Diabetic Retinopathy Image Dataset

Keywords

Loading...
Thumbnail Image

Issue Date

2024-08-14

Language

en

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

DOI

Abstract

Abstract Despite the unparalleled success of convolutional neural networks in computer vision tasks, their lack of interpretability prevents their use in the medical domain. Several methods exist to visualize the important parts of the image for the neural decision making, and thereby enhancing the explainability of the arti cial neural network predictions. In this thesis we will test and compare the performance of several explainability methods: CAM, Grad-CAM++, Integrated Gradients, and Guided Grad-CAM. For the evaluation both Mini ImageNet [24] and the indian diabetic retinopathy image dataset [17] on kaggle were used. For the Mini ImageNet dataset we see opposing values for di erent metrics, which generalized from training to validation and test set. Therefore, there is not a clear best method to explain a decision made by a convolutional neural network on image data. The same holds for the indian diabetic retinopathy image dataset. The results did roughly generalize from training to validation and test set, but the results did not generalize between the two datasets. This could be due to the small size of the indian diabetic retinopathy image dataset.

Description

Citation

Faculty

Faculteit der Sociale Wetenschappen