Representational Difference Explanations (RDX)

Updates

Paper was accepted to NeurIPS 2025!
NLMCD, TopK-SAE, and USAE added as baseline options
- tested on mnist modification and cub experiments

Setup

conda create -n "RDX" python=3.10.15
conda activate RDX
bash setup.sh

Downloading Checkpoints

Download checkpoints for the MNIST experiments here: bash download_checkpoints.sh

Downloading Datasets

MNIST: You can download the MNIST dataset with bash download_mnist.sh.
INaturalist (Subset): You can download the iNaturalist subset with bash download_inaturalist.sh.
CUB: You can download CUB and supplementary files for CUB CBMs with bash download_cub.sh.
ImageNet: You can download imagenet with bash download_imagenet.sh.

If you have already downloaded some of these datasets, you can symlink them to the data/ directory. See symlinks.txt for examples.

Experiments

To reproduce the MNIST experiments, download the checkpoints and the mnist dataset.
- bash mnist_835_experiment.sh for the MNIST subset experiment with only 3s, 5s, and 8s.
- bash mnist_modification_experiment_k=3.sh for the MNIST training modification experiments, with k=3.
To reproduce the CUB PCBM experiments, download the CUB dataset and run:
- bash cub_pcbm_v_cub_masked_pcbm.sh
To reproduce the ImageNet experiments, download the ImageNet dataset and run:
- bash dino_vs_dinov2_imagenet_ar.sh (aligned)
- bash dino_vs_dinov2_imagenet.sh (unaligned)
To reproduce the iNaturalist experiments, download the iNaturalist subset and run:
- bash clip_vs_clipinat_ar.sh (aligned)
- bash clip_vs_clipinat.sh (unaligned)

Minimal Example

The smallest dataset is the INat Subset. The fastest way to run a minimal example is to download the iNaturalist subset and run Experiment 4.

Visualizations

To visualize the results of the experiments, you can run: python analyze_explanations.py. By default this will analyze the inat subset experiment (aligned) (Exp. 4a). There are several commented functions in the script that you can uncomment to visualize the results of other experiments.

Citation

@article{kondapaneni2025repdiffexp,
  title={Representational Difference Explanations},
  author={Kondapaneni, Neehar and Mac Aodha, Oisin and Perona, Pietro},
  journal={arXiv preprint arXiv:2505.23917},
  year={2025}
}

Note on Reproducibility

Our original code seeded once at the start of comparisons for all methods, however, we realized it is likely to cause inconsistencies due to arbitrary choices made in the order of running the different comparisons. The new code re-seeds at the beginning of each comparison for all methods. This may lead to slightly different results than those reported in the paper, but we have checked that the trends remain the same. We apologize for any inconvenience this may cause.

Note on Concept Selection

We stayed as close as possible to the original concept selection strategies for each baseline method. However, TopK-SAEs, NLMCD, and USAE concept selection strategies were modified for fair comparison on our comparison tasks.

Let k = the number of concepts shown to the user.

TopK-SAE uses 50 latents during training with the top k remaining active. TopK-SAE are trained on each representation independently. After training, we select the k concepts per model with the largest mean activations to show to the user.
NLMCD uses HDBSCAN clustering and generates an arbitrary number of concepts for each representation. We measure concept similarity across models and select the top k most dissimilar concepts for our comparisons.
USAE learns an internal representation of 8 * (representation dimension), much larger than k. To select k concepts we measure firing entropy for each concept and select the k concepts per model with the lowest firing entropy. Firing entropy is defined in the USAE paper and measures how evenly a concept activates across the different models. Low entropy indicates that a concept is more specific to certain models, and is thus more likely to be useful for distinguishing them.

While we feel these choices are reasonable, it is possible that different concept selection strategies may improve baseline performance. Feel free to experiment with different strategies!

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
comparison_configs		comparison_configs
datasets		datasets
models		models
pymf		pymf
src		src
.gitignore		.gitignore
README.md		README.md
analyze_explanations.py		analyze_explanations.py
clip_vs_clipinat_inat.sh		clip_vs_clipinat_inat.sh
clip_vs_clipinat_inat_ar.sh		clip_vs_clipinat_inat_ar.sh
cub_pcbm_v_cub_masked_pcbm.sh		cub_pcbm_v_cub_masked_pcbm.sh
dino_vs_dinov2_imagenet.sh		dino_vs_dinov2_imagenet.sh
dino_vs_dinov2_imagenet_ar.sh		dino_vs_dinov2_imagenet_ar.sh
download_checkpoints.sh		download_checkpoints.sh
download_cub.sh		download_cub.sh
download_imagenet.sh		download_imagenet.sh
download_inat.sh		download_inat.sh
download_mnist.sh		download_mnist.sh
eval_model.py		eval_model.py
generate_comparison_explanations.py		generate_comparison_explanations.py
mnist_835_experiment.sh		mnist_835_experiment.sh
mnist_modification_experiments_k=3.sh		mnist_modification_experiments_k=3.sh
requirements.txt		requirements.txt
setup.py		setup.py
setup.sh		setup.sh
sym_links.txt		sym_links.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Representational Difference Explanations (RDX)

Updates

Setup

Downloading Checkpoints

Downloading Datasets

Experiments

Minimal Example

Visualizations

Citation

Note on Reproducibility

Note on Concept Selection

About

Uh oh!

Releases

Packages

Languages

nkondapa/RDX

Folders and files

Latest commit

History

Repository files navigation

Representational Difference Explanations (RDX)

Updates

Setup

Downloading Checkpoints

Downloading Datasets

Experiments

Minimal Example

Visualizations

Citation

Note on Reproducibility

Note on Concept Selection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages