Categorical Foundations of Explainable AI: A Unifying Theory

Barbiero, Pietro; Fioravanti, Stefano; Giannini, Francesco; Tonda, Alberto; Lio, Pietro; Di Lavore, Elena

Computer Science > Artificial Intelligence

arXiv:2304.14094 (cs)

[Submitted on 27 Apr 2023 (v1), last revised 17 Sep 2023 (this version, v3)]

Title:Categorical Foundations of Explainable AI: A Unifying Theory

Authors:Pietro Barbiero, Stefano Fioravanti, Francesco Giannini, Alberto Tonda, Pietro Lio, Elena Di Lavore

View PDF

Abstract:Explainable AI (XAI) aims to address the human need for safe and reliable AI systems. However, numerous surveys emphasize the absence of a sound mathematical formalization of key XAI notions -- remarkably including the term "explanation" which still lacks a precise definition. To bridge this gap, this paper presents the first mathematically rigorous definitions of key XAI notions and processes, using the well-funded formalism of Category theory. We show that our categorical framework allows to: (i) model existing learning schemes and architectures, (ii) formally define the term "explanation", (iii) establish a theoretical basis for XAI taxonomies, and (iv) analyze commonly overlooked aspects of explaining methods. As a consequence, our categorical framework promotes the ethical and secure deployment of AI technologies as it represents a significant step towards a sound theoretical foundation of explainable AI.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2304.14094 [cs.AI]
	(or arXiv:2304.14094v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2304.14094

Submission history

From: Pietro Barbiero [view email]
[v1] Thu, 27 Apr 2023 11:10:16 UTC (1,399 KB)
[v2] Fri, 19 May 2023 08:23:14 UTC (1,222 KB)
[v3] Sun, 17 Sep 2023 06:16:39 UTC (1,279 KB)

Computer Science > Artificial Intelligence

Title:Categorical Foundations of Explainable AI: A Unifying Theory

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Artificial Intelligence

Title:Categorical Foundations of Explainable AI: A Unifying Theory

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.