Causal analysis for robust interpretability of neural networks

Explainability and interpretability play an important role for adopting deep neural networks. Through analyzing the effect of path interventions at various nodes on model's performance, we are able to reveal the causal mechanisms within hidden layers and isolate the relevant components from noisy ones.

This repository contains the material used to obtain the results in our paper with LeNet trained on the MNIST dataset.

Prerequisites

Install python 3:

sudo apt update
sudo apt install python3
sudo apt install python3-pip

Install poetry :

curl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/get-poetry.py | python -

Installation

Clone the repo and go at the root

 git clone https://github.com/annonym/ixnn.git && cd ixnn

Setup poetry

 poetry install
 # The next part is to setup jupyter with poetry
 poetry run jupyter contrib nbextension install --user
 poetry run jupyter nbextensions_configurator enable --user
 poetry run ipython kernel install --user --name=explainnn

Launch example

To launch the main script simply run

poetry run python src/explainnn/main.py

A step by step demonstration is available in the jupyter-notebook

Config file

config.yaml contains all the parameters for the main script :

device : wether to run the script on CPU or GPU (⚠️ : if you want to use the GPU you have to install PyTorch alongside CUDA, see : https://pytorch.org/get-started/locally/)
dataset_name : the name of the dataset to use
model_name : the name of the model to use
learn_explainer : wether to generate the causal graph or not
target_idx : the index of the targeted class
n_samples : the number of samples used to generate the causal graph
soft_interventions : soft or hard interventions
graph_stab : wether to test graph stability
gen_attr : wether to generate attributions
save_attr : wether to save the generated attributions
vis_attr : wether to plot the attributions
eval_attr : wether to generate the metrics file
baseline_attr : wether to plot the baseline methods attributions
layer_name : the name of the layer used to generate attributions
layer_name_soft : the name of the layer used to test graph stability

For evaluation metrics and comparison with traditional attributions methods we used the quantus library

For other methods that don't exist in quantus library refer to this file

TO-DO

Add implementations on other architectures (ResNet18, ResNet50, ConvNext, ...) and datasets (MiniImageNet)

LICENSE

Licensed under Apache 2.0 License.

License will be released upon paper review completion

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset		dataset
model		model
output		output
src/explainnn		src/explainnn
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
demonstration.ipynb		demonstration.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Causal analysis for robust interpretability of neural networks

Prerequisites

Installation

Launch example

Config file

TO-DO

LICENSE

About

Uh oh!

Releases

Packages

Languages

OlaAhmad/ixnn

Folders and files

Latest commit

History

Repository files navigation

Causal analysis for robust interpretability of neural networks

Prerequisites

Installation

Launch example

Config file

TO-DO

LICENSE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages