This repository provides tools and models for detecting deepfake audio using machine learning techniques. The project focuses on enhancing detection robustness through explainability and feature engineering.
- custom_data.py: Script to load and preprocess custom datasets.
- features.py: Feature extraction methods for audio analysis.
- transformer.py: Implementation of a transformer-based model for classification.
- xgboost.py: XGBoost model for baseline comparison.
- Jupyter Notebooks: Analysis, feature importance, and explainability.
- Preprocess the data using
custom_data.py. - Extract features using
features.py. - Train and evaluate models using
transformer.pyandxgboost.py. - Analyze results and model explainability using the provided notebooks.
- Explainability tooling found in
.ipynbfiles.
Contributions are welcome! Please submit issues or pull requests for improvements.