GraFPrint: A GNN-Based Approach for Audio Identification

This is the official repository for our state-of-the-art audio identification framework based on graph neural networks. We demonstrate the code usage for training, audio fingerprint generation and evaluation. For more details, refer to the paper at Interanational Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2025.

Installation Guide

Clone the repository:

git clone https://github.com/username/GraFP.git
cd GraFP

Install the required Python packages:
```
pip install -r requirements.txt
```

Training Setup

As per our experiments, we recommend using the fma-small subset of the Free Music Archive (FMA) dataset. For the noise and room impulse response (RIR) dataset, we recommend using the MUSAN dataset and the Aachen Impulse Response database, respectively.

Setup the config files with paths to datasets

python setup_config.py --train_dir /PATH/TO/TRAIN/DATA --val_dir /PATH/TO/VALIDATION/DATA --noise_dir /PATH/TO/NOISE/DATA --ir_dir /PATH/TO/IR/DATA

Run the training script:
```
python train.py 
```

Generate Fingerprints

We provide a helper code for generating audio fingerprints for a given audio dataset. The pre-trained models are available here. The primary evaluation benchmarks have been computed using model_tc_29_best.pth.

python generate.py --test_dir /PATH/TO/TEST/DATA --ckp /PATH/TO/MODEL

Evaluation setup

For reproducibility, we have made the dummy fingerprint database available here. The fingerprint retrieval pipeline utilizes the FAISS library for the approximate nearest-neighbour (ANN) search in the fingerprint embedding space. Further details about the ANN implementation is available in our pre-print document. The icassp.sh script can be used to run the evaluation pipeline for reproducing the published results.

Download and extract the test dataset. The script supports evaluation on both the fma-medium and fma-large datasets. Note that extracting the compressed fma_large.zip can take a while. For quicker evaluation runs, we recommend extracting the fma_medium.zip.
Download and extract the augmentation dataset. Queries are created using subset of the background noise and impulse response datasets. They can be downloaded here.

Run the evaluation script with the pre-trained model:

bash icassp.sh /PATH/TO/EVAL/DATASET /PATH/TO/AUG/DATASET

Note that the evaluation dataset path provided in the above script should be the absolute path to the directory called fma_medium or fma_large. Logs such as raw outputs and retrieval hit-rates can be found in the logs/store directory. Each output run is organized according the filename of the pre-trained model used. Support for running evaluation on private datasets would be made available soon.

Citation

If you use this code in repository, please cite our paper:

@inproceedings{grafprint2025,
  title={GraFPrint: A GNN-Based Approach for Audio Identification},
  author={Bhattacharjee, Aditya and Singh, Shubhr and Benetos, Emmanouil},
  booktitle={ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={1--5},
  year={2025},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 544 Commits
__pycache__		__pycache__
baselines		baselines
checkpoint		checkpoint
config		config
data		data
encoder		encoder
modules		modules
simclr		simclr
unit_tests		unit_tests
LICENSE		LICENSE
README.md		README.md
ablation.sh		ablation.sh
eval.py		eval.py
generate.py		generate.py
icassp.sh		icassp.sh
peak_extractor.py		peak_extractor.py
requirements.txt		requirements.txt
setup_config.py		setup_config.py
setup_icassp.py		setup_icassp.py
test_fp.py		test_fp.py
test_pipeline.sh		test_pipeline.sh
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraFPrint: A GNN-Based Approach for Audio Identification

Installation Guide

Training Setup

Generate Fingerprints

Evaluation setup

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GraFPrint: A GNN-Based Approach for Audio Identification

Installation Guide

Training Setup

Generate Fingerprints

Evaluation setup

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages