red-paper

Code and supporting materials for the AAAI 2022 RED paper

This repository contains all the source codes to reproduce the experimental results reported in paper "Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model", which is published in AAAI 2022. (Arxiv Link: https://arxiv.org/abs/2010.02065)

Steps to reproduce the main results in the paper:

Before running the source code, download the 121 UCI datasets from https://github.com/bioinf-jku/SNNs, and the 4 newly added datasets from UCI Machine Learning Repository, namely, "Phishing","messidor","Bioconcentration", and "Climate".
Put the 121 UCI datasets into a folder named UCI121_data, and the 4 newly added dataset into a folder named Datasets.
Create a folder named Results to store all the experimental results.
Create a folder named Statistics to store all the analysis results.
pip install the packages in requirements.txt.
Run main_experiments_UCI.py to generate all the experimental results.
Run analysis_results_UCI.py to analyze the stored experimental results.

Note: the packages used in calculating Trust Score can be git cloned from https://github.com/google/TrustScore.

Steps to reproduce other experimental results in the paper:

For results of "SVGP" variant, run experiments_SVGP.py after obtaining the main results in Results folder.
For results of "DNGO" variant, run experiments_DNGO.py after obtaining the main results in Results folder. The source codes of DNGO can be git cloned from https://github.com/automl/pybnn/blob/master/pybnn/dngo.py.
For results of "Entropy" variant, it can be directly calculated from the outputs of original NN classifier using entropy function from scipy.stats.
For results regarding Bayesian Neural Networks, run experiments_BNN.py to generate the results of base BNN classifier, then run experiments_BNN+RED.py to get the results of applying RED on top of BNN classifiers.
For results regarding MC-dropout, run experiments_dropout.py to generate the results of base NN classifier with MC-dropout, then run experiments_dropout+RED.py to get the results of applying RED on top of NN classifiers with MC-dropout.
For results regarding VGG16/VGG19 on CIFAR-10/CIFAR-100, train the base VGG16/VGG19 classifiers using code from https://github.com/geifmany/cifar-vgg, then run the included codes that generate results on UCI datasets on CIFAR-10/CIFAR-100 results.
For results regarding OOD and adversarial samples, run experiments_OOD_adversarial.py.
For results regarding OOD detection in CIFAR-10 vs. SVHN, directly apply the trained VGG16 and RED models in step 6 to SVHN data downloaded from http://ufldl.stanford.edu/housenumbers/.
For results of BLR-residual, run experiments_BLR-residual.py after running main_experiments_UCI.py.

Citation

If you use RED in your research, please cite it using the following BibTeX entry

@article{qiu:aaai22, 
         title={Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model}, 
         volume={36}, 
         url={https://ojs.aaai.org/index.php/AAAI/article/view/20773}, 
         DOI={10.1609/aaai.v36i7.20773}, 
         number={7}, 
         journal={Proceedings of the AAAI Conference on Artificial Intelligence}, 
         author={Qiu, Xin and Miikkulainen, Risto}, 
         year={2022}, 
         month={Jun.}, 
         pages={8017-8027} }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

red-paper

Steps to reproduce the main results in the paper:

Steps to reproduce other experimental results in the paper:

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
analysis_results_UCI.py		analysis_results_UCI.py
experiments_BLR-residual.py		experiments_BLR-residual.py
experiments_BNN+RED.py		experiments_BNN+RED.py
experiments_BNN.py		experiments_BNN.py
experiments_DNGO.py		experiments_DNGO.py
experiments_OOD_adversarial.py		experiments_OOD_adversarial.py
experiments_SVGP.py		experiments_SVGP.py
experiments_dropout+RED.py		experiments_dropout+RED.py
experiments_dropout.py		experiments_dropout.py
main_experiments_UCI.py		main_experiments_UCI.py
requirements.txt		requirements.txt
util.py		util.py
util_DNGO.py		util_DNGO.py

Folders and files

Latest commit

History

Repository files navigation

red-paper

Steps to reproduce the main results in the paper:

Steps to reproduce other experimental results in the paper:

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages