Learning to Scaffold: Optimizing Model Explanations for Teaching

Fernandes, Patrick; Treviso, Marcos; Pruthi, Danish; Martins, André F. T.; Neubig, Graham

Computer Science > Machine Learning

arXiv:2204.10810 (cs)

[Submitted on 22 Apr 2022 (v1), last revised 30 Nov 2022 (this version, v2)]

Title:Learning to Scaffold: Optimizing Model Explanations for Teaching

Authors:Patrick Fernandes, Marcos Treviso, Danish Pruthi, André F. T. Martins, Graham Neubig

View PDF

Abstract:Modern machine learning models are opaque, and as a result there is a burgeoning academic subfield on methods that explain these models' behavior. However, what is the precise goal of providing such explanations, and how can we demonstrate that explanations achieve this goal? Some research argues that explanations should help teach a student (either human or machine) to simulate the model being explained, and that the quality of explanations can be measured by the simulation accuracy of students on unexplained examples. In this work, leveraging meta-learning techniques, we extend this idea to improve the quality of the explanations themselves, specifically by optimizing explanations such that student models more effectively learn to simulate the original model. We train models on three natural language processing and computer vision tasks, and find that students trained with explanations extracted with our framework are able to simulate the teacher significantly more effectively than ones produced with previous methods. Through human annotations and a user study, we further find that these learned explanations more closely align with how humans would explain the required decisions in these tasks. Our code is available at this https URL

Comments:	10 pages. NeurIPS 2022
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.10810 [cs.LG]
	(or arXiv:2204.10810v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.10810

Submission history

From: Marcos Vinícius Treviso [view email]
[v1] Fri, 22 Apr 2022 16:43:39 UTC (3,449 KB)
[v2] Wed, 30 Nov 2022 03:02:03 UTC (1,968 KB)

Computer Science > Machine Learning

Title:Learning to Scaffold: Optimizing Model Explanations for Teaching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Machine Learning

Title:Learning to Scaffold: Optimizing Model Explanations for Teaching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.