pENC: Parallel Audio Feature Extraction and Classification

Overview

pENC is a high-performance, hybrid C++/Python audio processing toolkit designed for large-scale, parallel feature extraction and classification of audio files. It leverages OpenCL for GPU acceleration, Cython for CPU parallelism, and Python for orchestration and machine learning, making it ideal for research and production environments where speed and flexibility are critical.

Features

Blazing Fast MFCC Extraction: Utilizes clFFT and OpenCL for GPU-accelerated FFT and power spectrum computation.
Cython and C++ Hybrid: Seamlessly switches between Cython (CPU) and C++/OpenCL (GPU) for optimal performance.
Batch Audio Processing: Efficiently processes thousands of audio files with minimal CPU and GPU idle time.
Custom OpenCL Kernels: Includes custom kernels for Hamming window, mel filterbank, and DCT operations.
Python Orchestration: Easy-to-use Python interface for feature extraction, model training, and evaluation.
Machine Learning Ready: Integrates with scikit-learn for training and evaluating classifiers.
Flexible Build System: CMake-based build with Visual Studio and CMake Tools extension support.
Extensive Dataset Support: Designed to handle large datasets with robust error handling and caching.

Project Structure

pENC/
  ├── classifier.py           # Main Python script for feature extraction and classification
  ├── mfcc_extractor.cpp      # C++/OpenCL backend for MFCC extraction
  ├── mfcc_kernel.cl          # OpenCL kernels for DSP operations
  ├── logic.py, shortcut.py   # Supporting Python modules
  ├── setup.py                # Cython build script
  ├── CMakeLists.txt          # CMake build configuration
  ├── dataset/                # Audio dataset and metadata
  ├── build/                  # Build artifacts
  ├── LICENSE                 # License file
  └── README.md               # This file

Requirements

Python 3.13
Cython
OpenCL 2.0+ compatible GPU (tested on Intel UHD Graphics 600)
clFFT library (for GPU FFT)
Visual Studio 2022 (for C++/CMake build)
scikit-learn, numpy, pandas, soundfile, joblib (Python dependencies)

Installation & Build

Install Python 3.13 and dependencies:

pip install cython numpy pandas scikit-learn soundfile joblib

Install clFFT:
- Download from clFFT GitHub
- Extract and add clFFT/bin to your PATH or use os.add_dll_directory in Python (already handled in classifier.py)
Build C++/Cython extensions:
- Use the CMake Tools extension in VS Code or run:
```
cmake -S . -B build
cmake --build build --config Release
```

Usage

Place your audio files and metadata in the dataset/ directory.
Run the main pipeline:
```
python classifier.py
```
The script will extract MFCC features, train a classifier, and report accuracy.

Customization

Switch between Debug/Release DLLs: The Python code automatically selects the correct DLL based on how you run (F5 for Debug, Ctrl+F5 for Release).
Add new features: Extend mfcc_extractor.cpp or mfcc_kernel.cl for custom DSP operations.
Dataset: Update dataset/Classifier.csv and place audio files in dataset/classifier/.

License

This project is licensed to Sayed Arham Abbas Rizvi, 2025. See LICENSE for details.

Developed by Sayed Arham Abbas Rizvi. For research and private use only.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pENC: Parallel Audio Feature Extraction and Classification

Overview

Features

Project Structure

Requirements

Installation & Build

Usage

Customization

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
Scripts		Scripts
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
classifier.py		classifier.py
dataset.lnk		dataset.lnk
inverse_wave.pyx		inverse_wave.pyx
logic.py		logic.py
mfcc_extractor.cpp		mfcc_extractor.cpp
mfcc_kernel.cl		mfcc_kernel.cl
setup.py		setup.py
shortcut.py		shortcut.py
visualization.py		visualization.py

License

Arham-Abbas/pENC

Folders and files

Latest commit

History

Repository files navigation

pENC: Parallel Audio Feature Extraction and Classification

Overview

Features

Project Structure

Requirements

Installation & Build

Usage

Customization

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages