Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks

Zeng, Chunyan; Yan, Kang; Wang, Zhifeng; Yu, Yan; Xia, Shiyan; Zhao, Nan

doi:10.1007/s11760-022-02313-0

Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks

Original Paper
Published: 27 July 2022

Volume 17, pages 1069–1076, (2023)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Chunyan Zeng¹,
Kang Yan¹,
Zhifeng Wang ORCID: orcid.org/0000-0001-6960-509X²,
Yan Yu¹,
Shiyan Xia¹ &
…
Nan Zhao¹

865 Accesses
1 Altmetric
Explore all metrics

Abstract

The black-box nature of deep neural networks severely hinders its performance improvement and application in specific scenes. In recent years, class activation mapping-based method has been widely used to interpret the internal decisions of models in computer vision tasks. However, when this method uses backpropagation to obtain gradients, it will cause noise in the saliency map and even locate features that are irrelevant to decisions. In this paper, we propose an absolute value class activation mapping-based (Abs-CAM) method, which optimizes the gradients derived from the backpropagation and turns all of them into positive gradients to enhance the visual features of output neurons’ activation and improve the localization ability of the saliency map. The framework of Abs-CAM is divided into two phases: generating initial saliency map and generating final saliency map. The first phase improves the localization ability of the saliency map by optimizing the gradient, and the second phase linearly combines the initial saliency map with the original image to enhance the semantic information of the saliency map. We conduct qualitative and quantitative evaluation of the proposed method, including Deletion, Insertion, and Pointing Game. The experimental results show that the Abs-CAM can obviously eliminate the noise in the saliency map, and can better locate the features related to decisions, and is superior to the previous methods in recognition and localization tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value

Multi-size Scaled CAM for More Accurate Visual Interpretation of CNNs

Understanding Individual Decisions of CNNs via Contrastive Backpropagation

References

Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-cam: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 618–626 (2017)
Chattopadhay, A., Sarkar, A., Howlader, P., Balasubramanian, V.N.: Grad-cam++: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 839–847. IEEE (2018)
Wang, H., Wang, Z., Du, M., Yang, F., Zhang, Z., Ding, S., Mardziel, P., Hu, X.: Score-cam: score-weighted visual explanations for convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 24–25 (2020)
Lee, K.H., Park, C., Oh, J., Kwak, N.: Lfi-cam: learning feature importance for better visual explanation (2021)
Petsiuk, V., Das, A., Saenko, K.: Rise: randomized input sampling for explanation of black-box models (2018). arXiv preprint arXiv:1806.07421
Fong, R.C., Vedaldi, A.: Interpretable explanations of black boxes by meaningful perturbation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3429–3437 (2017)
Agarwal, C., Schonfeld, D., Nguyen, A.: Removing input features via a generative model to explain their attributions to classifier’s decisions (2019). arXiv preprint arXiv:1910.04256
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818–833. Springer (2014)
Sundararajan, M., Taly, A., Yan, Q.: Gradients of counterfactuals (2016). arXiv preprint arXiv:1611.02639
Smilkov, D., Thorat, N., Kim, B., Viégas, F., Wattenberg, M.: Smoothgrad: removing noise by adding noise (2017). arXiv preprint arXiv:1706.03825
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2921–2929 (2016)
Zhang, Q., Rao, L., Yang, Y.: Group-cam: group score-weighted visual explanations for deep convolutional networks (2021)
Lee, J.R., Kim, S., Park, I., Eo, T., Hwang, D.: Relevance-cam: your model already knows where to look. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14944–14953 (2021)
Omeiza, D., Speakman, S., Cintas, C., Weldermariam, K.: Smooth grad-cam++: an enhanced inference level visualization technique for deep convolutional neural network models (2019). arXiv preprint arXiv:1908.01224
Adebayo, J., Gilmer, J., Muelly, M., Goodfellow, I., Hardt, M., Kim, B.: Sanity checks for saliency maps (2018). arXiv preprint arXiv:1810.03292

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (Nos. 61901165, 62177022, and 61501199), Collaborative Innovation Center for Informatization and Balanced Development of K-12 Education by MOE and Hubei Province (No. xtzd2021-005), Self-determined Research Funds of CCNU from the Colleges’ Basic Research and Operation of MOE (No. CCNU20ZT010), and Hubei Natural Science Foundation (No. 2017CFB683).

Author information

Authors and Affiliations

Hubei Key Laboratory for High-efficiency Utilization of Solar Energy and Operation Control of Energy Storage System, Hubei University of Technology, Wuhan, 430068, China
Chunyan Zeng, Kang Yan, Yan Yu, Shiyan Xia & Nan Zhao
Department of Digital Media Technology, Central China Normal University, Wuhan, 430079, China
Zhifeng Wang

Authors

Chunyan Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Kang Yan
View author publications
You can also search for this author in PubMed Google Scholar
Zhifeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Shiyan Xia
View author publications
You can also search for this author in PubMed Google Scholar
Nan Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhifeng Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zeng, C., Yan, K., Wang, Z. et al. Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks. SIViP 17, 1069–1076 (2023). https://doi.org/10.1007/s11760-022-02313-0

Download citation

Received: 20 January 2022
Revised: 30 June 2022
Accepted: 01 July 2022
Published: 27 July 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11760-022-02313-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value

Multi-size Scaled CAM for More Accurate Visual Interpretation of CNNs

Understanding Individual Decisions of CNNs via Contrastive Backpropagation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value

Multi-size Scaled CAM for More Accurate Visual Interpretation of CNNs

Understanding Individual Decisions of CNNs via Contrastive Backpropagation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.