Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Bianchi, Matteo; De Santis, Antonio; Tocchetti, Andrea; Brambilla, Marco

doi:10.24963/ijcai.2024/411

Computer Science > Machine Learning

arXiv:2405.03301 (cs)

[Submitted on 6 May 2024]

Title:Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Authors:Matteo Bianchi, Antonio De Santis, Andrea Tocchetti, Marco Brambilla

View PDF HTML (experimental)

Abstract:Transparency and explainability in image classification are essential for establishing trust in machine learning models and detecting biases and errors. State-of-the-art explainability methods generate saliency maps to show where a specific class is identified, without providing a detailed explanation of the model's decision process. Striving to address such a need, we introduce a post-hoc method that explains the entire feature extraction process of a Convolutional Neural Network. These explanations include a layer-wise representation of the features the model extracts from the input. Such features are represented as saliency maps generated by clustering and merging similar feature maps, to which we associate a weight derived by generalizing Grad-CAM for the proposed methodology. To further enhance these explanations, we include a set of textual labels collected through a gamified crowdsourcing activity and processed using NLP techniques and Sentence-BERT. Finally, we show an approach to generate global explanations by aggregating labels across multiple images.

Comments:	International Joint Conference on Artificial Intelligence 2024 (to be published)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.03301 [cs.LG]
	(or arXiv:2405.03301v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.03301
Journal reference:	IJCAI 2024
Related DOI:	https://doi.org/10.24963/ijcai.2024/411

Submission history

From: Antonio De Santis [view email]
[v1] Mon, 6 May 2024 09:21:35 UTC (3,192 KB)

Computer Science > Machine Learning

Title:Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Machine Learning

Title:Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.