0% found this document useful (0 votes)

18 views4 pages

theseGNN XAI

The document outlines a PhD position at GREYC laboratory focused on explainable AI for graph data augmentation in machine learning, starting in the fourth quarter of 2025. The research aims to enhance graph neural networks (GNNs) by characterizing datasets, identifying learning biases, and generating additional data to improve model performance. Candidates should have a background in computer science or applied mathematics, with strong programming skills and experience in data science or deep learning preferred.

Uploaded by

jeanadamado10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

theseGNN XAI

Uploaded by

jeanadamado10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Explainable AI for Graph Data Augmentation in

Machine Learning

PhD position – starting fourth quarter 2025

Location :
GREYC laboratory, CNRS UMR 6072, Université de Caen Normandie, 14000 Caen, France

Scientific context
Pandora This thesis is financed within the Pandora project funded by the French ANR
(National Research Agency), underway since February 2025. Pandora is situated in the
context of explainable artificial intelligence (XAI) as applied to graph neural networks
(GNN). By focusing on the internal functioning of GNNs, the objectives of the project are
as follows :
— characterize, understand and clearly explain the internal workings of GNNs using
pattern extraction techniques ;
— uncover statistically significant patterns of neural activation, called “activation rules,”
to determine how networks encode concepts [7, 8] ;
— translate these activation rules into graph patterns interpretable by a user ;
— use this knowledge to improve GNNs by identifying learning biases, generating
additional data, and building explanatory systems.
The thesis will be concerned with the last of those research questions.
The work carried out in this project (and by extension in the thesis) will be partially
based on molecular data resulting from biochemical experiments from our collaboration
with the CERMN laboratory (Centre d’Études et de Recherche sur le Médicament de
Normandie), University of Caen Normandy.

Problem setting In machine learning, we do not always have training data sets that are
sufficiently representative of the real world (for example, chemical/biological experiments
often focus only on certain well-explored molecules or certain therapeutic targets). How to
detect that a training data set is insufficient ? Two non-exhaustive proposals for this :
— possible parts of the data space are not represented (e.g. some node/edge combina-
tions cannot be found).
— the learned model is unreliable in some subspaces of the data (the reliability of
a supervised model can be studied, for example, by looking at the importance of
instances in the construction of decision boundaries).
The literature contains methods to characterize data in a model-independent manner [5]
and methods to characterize the behavior of a model based on the components of the
individual graphs considered [9, 2, 6, 3, 4, 1]. However, there is no approach that establishes

1
the link between data and the performance of a specific model. Furthermore, there exist
no approaches for augmenting the data as a means for improving model performance and
reliability. The thesis is intended to address these gaps.

Objectives
This thesis has three objectives. First, we want to characterize at a global level graph
datasets in a way similar to that already used for vectorial datasets. Second, we want
to design one (or more) approaches to use the explanations of the behavior of GNNs to
identify relevant instances of the training set used. Finally, we leverage the results of the
first two points to generate additional data instances to improve the data set and therefore
render GNNs more accurate and more robust.

Topic and overview of the work plan of the thesis

In short, the thesis deals with the use of patterns learnt from GNN to improve GNNs by
identifying learning biases, generating additional data, and building explanatory systems.
More precisely, we wish to develop new methods to improve the learning of graph models
by relying on the analysis of the internal functioning of these models via, for example,
activation rules expressed in the latent space. This will involve analyzing decision boun-
daries, characterizing the errors of the model studied in the data space or in their latent
representations in order to propose corrective solutions. This approach can be broken down
into sub-problems :
Data characterization and bias identification. The characterization of training
data can help identify instances on which the model commits errors but also detect
whether the data are not the source of bias in learning. One work direction is to
study the complexity of activation rules and compare them to domain knowledge.
Targeted generation of additional data. Once the model’s limitations have been
identified, we want to automatically define "corrective patches" to improve the
model’s robustness. A preferred area of work will be the generation of targeted
additional data to allow the model to better separate the data according to the
class studied in the constructed representation.
The first problem, i.e. data characterization will start from the knowledge developed in
meta-learning for vectorial data, combined with existing work explaining GNN predictions
and on activation rules.
The second problem poses relatively complex research questions since realistic graph
data with desired properties is rather hard to generate. While a number of graph data ge-
nerators exist in the literature, the generated data have often been found to lack properties
observed in real-world data.

Preliminary work plan

1. Conduct a literature review of methods for explaining the behavior of GNN models
[9, 2, 8, 7, 6, 3, 4, 1]. The aim of this study is to establish in what sense the different
methods identify certain aspects of the data used to train the model.
2. Design and implement approaches to identify the instances (graphs) involved by the
explanatory descriptors/rules. It is not certain that such approaches will be found
for all of them, which will then lead to a selection of descriptors. Highlighting the

2
instances and subgraphs linked to the explanatory descriptors/rules will also allow
to determine how the descriptors characterize different subsets of data.
3. Develop a formalism to extend concepts defined for vector data (density, decision
boundaries, value distribution) to graph data. This formalism, in combination with
the results of step 2, will allow to determine where learning instances are missing
in a training dataset and thus where it is useful to generate synthetic data.
4. Exploit the information derived from the first three points, as well as others —
for instance graph patterns extracted using pattern mining methods — to define
constraints on symbolic data generators to arrive at data with precise properties
that fill the gaps in the data sets.
5. Evaluate the generated data in the context of project use cases, particularly mole-
cular data activity prediction.

Keywords : Statistical learning, graph neural networks, explainable AI, data mining.
Thesis period : Starting in autumn 2025
Remuneration : Approximately 2,200e gross per month.
Supervising team :
— Bruno Crémilleux (GREYC – Université de Caen Normandie).
— Marc Plantevit (LRE – EPITA)
— Albrecht Zimmermann (GREYC – Université de Caen Normandie).

Candidate profile
The candidate must be enrolled in the final year of a Master’s degree or an engineering
degree, or hold such a degree, in a field related to computer science or applied mathematics,
and have solid programming skills. Experience in data science, deep learning, etc. would
be a plus.The candidate must be able to write scientific reports and communicate research
results at conferences in English.

To apply
Application period : from now until the position is filled.
Send the following documents (exclusively in pdf format) to bruno.cremilleux@unicaen.
fr, marc.plantevit@epita.fr et albrecht.zimmermann@unicaen.fr :
— cover letter explaining your qualifications, experiences and motivation for this sub-
ject ;
— curriculum vitae ;
— transcript of grades (if possible with ranking) of 3rd year of Bachelor’s degree, 1st
and 2nd year of Master’s degree or equivalent for engineering schools ;
— if possible, names of people (teachers or other person) who can provide information
on your skills and your work ;
— a link to personal project repositories (e.g. GitHub) ;
— any other information you consider useful.

3
Références
[1] C. Abrate, G. Preti, and F. Bonchi. Counterfactual explanations for graph classification
through the lenses of density. In World Conference on Explainable Artificial Intelligence,
pages 324–348. Springer, 2023.
[2] A. Duval and F. D. Malliaros. Graphsvx : Shapley value explanations for graph neu-
ral networks. In Machine Learning and Knowledge Discovery in Databases. Research
Track : European Conference, ECML PKDD 2021, Bilbao, Spain, September 13–17,
2021, Proceedings, Part II 21, pages 302–318. Springer, 2021.
[3] Q. Huang, M. Yamada, Y. Tian, D. Singh, and Y. Chang. Graphlime : Local interpre-
table model explanations for graph neural networks. IEEE Transactions on Knowledge
and Data Engineering, 35(7) :6968–6972, 2022.
[4] A. Mastropietro, G. Pasculli, C. Feldmann, R. Rodríguez-Pérez, and J. Bajorath. Ed-
geshaper : Bond-centric shapley value-based explanation method for graph neural net-
works. Iscience, 25(10), 2022.
[5] M. A. Munoz, L. Villanova, D. Baatar, and K. Smith-Miles. Instance spaces for machine
learning classification. Machine Learning, 107(1) :109–147, 2018.
[6] A. Perotti, P. Bajardi, F. Bonchi, and A. Panisson. Graphshap : Explaining
identity-aware graph classifiers through the language of motifs. arXiv preprint
arXiv :2202.08815, 2022.
[7] L. Veyrin-Forrer, A. Kamal, S. Duffner, M. Plantevit, and C. Robardet. In pursuit of
the hidden features of gnn’s internal representations. Data & Knowledge Engineering,
142 :102097, 2022.
[8] L. Veyrin-Forrer, A. Kamal, S. Duffner, M. Plantevit, and C. Robardet. On gnn ex-
plainability with activation rules. Data Mining and Knowledge Discovery, pages 1–35,
2022.
[9] H. Yuan, H. Yu, J. Wang, K. Li, and S. Ji. On explainability of graph neural networks
via subgraph explorations. In M. Meila and T. Zhang, editors, Proceedings of the 38th
International Conference on Machine Learning, volume 139 of Proceedings of Machine
Learning Research, pages 12241–12252. PMLR, 18–24 Jul 2021.

c5 Apelete Sossou Jad Zakharia
No ratings yet
c5 Apelete Sossou Jad Zakharia
6 pages
2022 Book GraphNeuralNetworksFoundations PDF
100% (2)
2022 Book GraphNeuralNetworksFoundations PDF
701 pages
Introduction To Graph Neural Networks - Zhiyuan Liu & Jie Zhou
No ratings yet
Introduction To Graph Neural Networks - Zhiyuan Liu & Jie Zhou
142 pages
CS224w Machine Learning With Graphs
No ratings yet
CS224w Machine Learning With Graphs
127 pages
Exploring Concepts of HyperFuzzy, HyperNeutrosophic, and HyperPlithogenic Sets (II)
No ratings yet
Exploring Concepts of HyperFuzzy, HyperNeutrosophic, and HyperPlithogenic Sets (II)
61 pages
2022 - Chuan Shi, Xiao Wang, Cheng Yang - Advances in Graph Neural Networks-Springer
No ratings yet
2022 - Chuan Shi, Xiao Wang, Cheng Yang - Advances in Graph Neural Networks-Springer
207 pages
Intelligent Systems: João Carlos Xavier-Junior Ricardo Araújo Rios
No ratings yet
Intelligent Systems: João Carlos Xavier-Junior Ricardo Araújo Rios
686 pages
Master Thesis Mattias Wiberg Jonas Lauri
No ratings yet
Master Thesis Mattias Wiberg Jonas Lauri
75 pages
Razpis MR 2025 Ang
No ratings yet
Razpis MR 2025 Ang
13 pages
Zhiyuan L. Introduction To Graph Neural Networks 2020
No ratings yet
Zhiyuan L. Introduction To Graph Neural Networks 2020
129 pages
Graph Neural Network Introduction
No ratings yet
Graph Neural Network Introduction
88 pages
Graph Neural Networks Methods Applications and Opp
No ratings yet
Graph Neural Networks Methods Applications and Opp
35 pages
Ye Et Al. 2024
No ratings yet
Ye Et Al. 2024
19 pages
Bacciu 2020
No ratings yet
Bacciu 2020
62 pages
WWW23-Tutorial-V6 Self-Supervised Learning and Pre-Training On Graphs
No ratings yet
WWW23-Tutorial-V6 Self-Supervised Learning and Pre-Training On Graphs
107 pages
2022-Trustworthy AI-A Computational Perspective
No ratings yet
2022-Trustworthy AI-A Computational Perspective
59 pages
Thesis Master 2022 Application of GNN For Graph Classification
No ratings yet
Thesis Master 2022 Application of GNN For Graph Classification
81 pages
Call For Applications
No ratings yet
Call For Applications
2 pages
(2020 Arxiv) A Survey On The Expressive Power of Graph Neural Networks
No ratings yet
(2020 Arxiv) A Survey On The Expressive Power of Graph Neural Networks
42 pages
12.advanced DL Topics
No ratings yet
12.advanced DL Topics
104 pages
Graph Neural Networks (GNNS)
No ratings yet
Graph Neural Networks (GNNS)
22 pages
2782 On The Generalization of
No ratings yet
2782 On The Generalization of
28 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
61 pages
GNNs
No ratings yet
GNNs
28 pages
GNN - PEter
No ratings yet
GNN - PEter
96 pages
Computing Graph Neural Networks: A Survey From Algorithms To Accelerators
No ratings yet
Computing Graph Neural Networks: A Survey From Algorithms To Accelerators
38 pages
GNN Medium Article
No ratings yet
GNN Medium Article
33 pages
GNN Foundations Frontiers and Applications Chapter5
No ratings yet
GNN Foundations Frontiers and Applications Chapter5
36 pages
Theory of Graph Neural Networks: Representation and Learning
No ratings yet
Theory of Graph Neural Networks: Representation and Learning
23 pages
Moler
No ratings yet
Moler
22 pages
Graph Representation Learning
No ratings yet
Graph Representation Learning
141 pages
Trans GNN
100% (1)
Trans GNN
11 pages
Learning Methods
No ratings yet
Learning Methods
70 pages
Suevey On GNN
No ratings yet
Suevey On GNN
31 pages
CAAI Trans On Intel Tech - 2024 - Sharma - Image and Video Analysis Using Graph Neural Network For Internet of Medical
No ratings yet
CAAI Trans On Intel Tech - 2024 - Sharma - Image and Video Analysis Using Graph Neural Network For Internet of Medical
15 pages
An End-To-End Attention-Based Approach For Learning On Graphs
No ratings yet
An End-To-End Attention-Based Approach For Learning On Graphs
16 pages
On GNN Explanability With Activation Rules
No ratings yet
On GNN Explanability With Activation Rules
32 pages
Approximation - and Quantization-Aware Training For Graph Neural Networks
No ratings yet
Approximation - and Quantization-Aware Training For Graph Neural Networks
14 pages
Enhancing Attribute-Driven Fraud Detection With Risk-Aware Graph Representation
No ratings yet
Enhancing Attribute-Driven Fraud Detection With Risk-Aware Graph Representation
12 pages
GNNChap 7
No ratings yet
GNNChap 7
26 pages
Gnnexplainer: Generating Explanations For Graph Neural Networks
No ratings yet
Gnnexplainer: Generating Explanations For Graph Neural Networks
13 pages
GRL Book-Chapter 5-GNNs
No ratings yet
GRL Book-Chapter 5-GNNs
21 pages
Three-Dimensional Structural Geological Modeling U
No ratings yet
Three-Dimensional Structural Geological Modeling U
25 pages
XAI Grafos Autismo PUB
No ratings yet
XAI Grafos Autismo PUB
20 pages
PHD Position
No ratings yet
PHD Position
1 page
MODULE 2 Deep Learning
No ratings yet
MODULE 2 Deep Learning
26 pages
NeurIPS 2020 Graph Random Neural Networks For Semi Supervised Learning On Graphs Paper
No ratings yet
NeurIPS 2020 Graph Random Neural Networks For Semi Supervised Learning On Graphs Paper
12 pages
Graphprompt: Unifying Pre-Training and Downstream Tasks For Graph Neural Networks
No ratings yet
Graphprompt: Unifying Pre-Training and Downstream Tasks For Graph Neural Networks
12 pages
Hyperspectral Image Classification Based On Deep Attention Graph Convolutional Network
No ratings yet
Hyperspectral Image Classification Based On Deep Attention Graph Convolutional Network
16 pages
Graphormer 2021 neurIPS
No ratings yet
Graphormer 2021 neurIPS
12 pages
Improving Graph Neural Network Expressivity Via Subgraph Isomorphism Counting
No ratings yet
Improving Graph Neural Network Expressivity Via Subgraph Isomorphism Counting
12 pages
PyTorch & PyTorch Geometric
No ratings yet
PyTorch & PyTorch Geometric
21 pages
GNN Foundations Frontiers and Applications Chapter3
No ratings yet
GNN Foundations Frontiers and Applications Chapter3
11 pages
Physics-Informed Deep Neural Operator Networks
No ratings yet
Physics-Informed Deep Neural Operator Networks
34 pages
Seminar Presentation
No ratings yet
Seminar Presentation
19 pages
FGNN2 A Powerful Pre-Training Framework For Learning The Logic Functionality of Circuits Public
No ratings yet
FGNN2 A Powerful Pre-Training Framework For Learning The Logic Functionality of Circuits Public
14 pages
Why Are Graph Neural Networks Effective For EDA Problems
No ratings yet
Why Are Graph Neural Networks Effective For EDA Problems
8 pages
Offre de Thèse
No ratings yet
Offre de Thèse
6 pages
Design Space For Graph Neural Network
No ratings yet
Design Space For Graph Neural Network
9 pages
Geofit Thesis
No ratings yet
Geofit Thesis
6 pages
Self Attention Graph Pooling
No ratings yet
Self Attention Graph Pooling
10 pages
29256-Article Text-33310-1-2-20240324
No ratings yet
29256-Article Text-33310-1-2-20240324
9 pages
CAD 3D Model Classification by Graph Neural Networks: A New Approach Based On STEP Format
No ratings yet
CAD 3D Model Classification by Graph Neural Networks: A New Approach Based On STEP Format
11 pages
CF-GO-Net: A Universal Distribution Learner Via Characteristic Function Networks With Graph Optimizers
No ratings yet
CF-GO-Net: A Universal Distribution Learner Via Characteristic Function Networks With Graph Optimizers
11 pages
23 - AAAI - Substructure Aware Graph Neural Networks
No ratings yet
23 - AAAI - Substructure Aware Graph Neural Networks
9 pages
Gnni: A P G M - L E G N N: Nterpreter Robabilistic Enerative Odel Evel Xplanation For Raph Eural Etworks
No ratings yet
Gnni: A P G M - L E G N N: Nterpreter Robabilistic Enerative Odel Evel Xplanation For Raph Eural Etworks
25 pages
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
No ratings yet
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
70 pages
AlonAndYahav 2021 On The Bottleneck of Graph Neu
No ratings yet
AlonAndYahav 2021 On The Bottleneck of Graph Neu
16 pages
Masked Attention Is All You Need For Graphs: Duvenaud Et Al. 2015 Kearnes Et Al. 2016 Gilmer Et Al. 2017
No ratings yet
Masked Attention Is All You Need For Graphs: Duvenaud Et Al. 2015 Kearnes Et Al. 2016 Gilmer Et Al. 2017
15 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
Original GNN
No ratings yet
Original GNN
22 pages
Csye 7374 Coursde Syllabus
No ratings yet
Csye 7374 Coursde Syllabus
7 pages
Article
No ratings yet
Article
10 pages
Exploring Node Classification U Ncertainty in Graph Neural
No ratings yet
Exploring Node Classification U Ncertainty in Graph Neural
5 pages
Near Real-Time Distributed State Estimation Via AI ML-Empowered 5G Networks
No ratings yet
Near Real-Time Distributed State Estimation Via AI ML-Empowered 5G Networks
6 pages
Graph Anomaly Detection With Graph Neural Networks Current Status and Challenges
No ratings yet
Graph Anomaly Detection With Graph Neural Networks Current Status and Challenges
10 pages
Deep Learning On Graphs: A Survey: Ziwei Zhang, Peng Cui and Wenwu Zhu, Fellow, IEEE
No ratings yet
Deep Learning On Graphs: A Survey: Ziwei Zhang, Peng Cui and Wenwu Zhu, Fellow, IEEE
24 pages
ICML20 GRL Workshop
No ratings yet
ICML20 GRL Workshop
5 pages
Inria Prairie Phd-Physics-Vfm
No ratings yet
Inria Prairie Phd-Physics-Vfm
3 pages
Description
No ratings yet
Description
3 pages
Description
No ratings yet
Description
3 pages
On The Bottleneck of Graph Neural Networks
No ratings yet
On The Bottleneck of Graph Neural Networks
16 pages
A Systematic Survey of Chemical Pre-Trained Models
No ratings yet
A Systematic Survey of Chemical Pre-Trained Models
9 pages
Improving Graph Neural Networks With Simple Architecture Design
No ratings yet
Improving Graph Neural Networks With Simple Architecture Design
10 pages
PHD - College Dublin
No ratings yet
PHD - College Dublin
2 pages
Background
No ratings yet
Background
2 pages
PHD Offer
No ratings yet
PHD Offer
1 page
Leveraging A I
No ratings yet
Leveraging A I
10 pages
Graph Contrastive Learning With Augmentations
No ratings yet
Graph Contrastive Learning With Augmentations
12 pages
Graphnorm: A Principled Approach To Accelerating Graph Neural Network Training
No ratings yet
Graphnorm: A Principled Approach To Accelerating Graph Neural Network Training
25 pages
ICM Job Offer PHD Thesis Seismicity 2025
No ratings yet
ICM Job Offer PHD Thesis Seismicity 2025
2 pages
LifeLab PHD Role Description
No ratings yet
LifeLab PHD Role Description
2 pages
GNNS
No ratings yet
GNNS
7 pages
GNN Review
No ratings yet
GNN Review
26 pages
A Comprehensive Survey On Graph Neural Networks
No ratings yet
A Comprehensive Survey On Graph Neural Networks
22 pages
Papers Papers PDF
No ratings yet
Papers Papers PDF
48 pages
Graph Transformer Networks: Corresponding Author
No ratings yet
Graph Transformer Networks: Corresponding Author
11 pages
A Comprehensive Survey of Graph Neural Networks PDF
No ratings yet
A Comprehensive Survey of Graph Neural Networks PDF
22 pages
Graph Neural Networks: Primeview
No ratings yet
Graph Neural Networks: Primeview
1 page
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

theseGNN XAI

Uploaded by

theseGNN XAI

Uploaded by

Explainable AI for Graph Data Augmentation in

PhD position – starting fourth quarter 2025

Topic and overview of the work plan of the thesis

Preliminary work plan

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.