Acousti-Scan RS

A powerful, Rust-based audio fingerprinting and recognition system that can identify songs, build music libraries, and provide real-time audio analysis capabilities. Built with modern web technologies and a robust Rust backend.

📋 Table of Contents

🎵 Overview
✨ Features
🛠️ Tech Stack
📦 Installation
🚀 Usage
📡 API Reference
🏗️ Project Architecture
🧪 Development
🤝 Contributing
📄 License
🎯 Roadmap
📞 Contact & Support

🎵 Overview

Acousti-Scan RS is a comprehensive audio fingerprinting solution that combines the performance of Rust with the flexibility of a modern web interface. The system can analyze audio files, create unique fingerprints, and match unknown audio against a database of known songs. It features both a web-based client interface and a REST API for programmatic access.

The project implements Shazam-like functionality using advanced signal processing techniques including spectrograms, peak detection, and hash-based fingerprinting algorithms.

✨ Features

Core Functionality

🎯 Audio Identification: Identify songs from audio files or recordings with high accuracy
📚 Library Management: Build and maintain a comprehensive music database
🎤 Real-time Recognition: Process live audio input for instant song identification
📊 Audio Analysis: Generate spectrograms and analyze audio characteristics
🔗 YouTube Integration: Automatic metadata fetching and YouTube ID resolution
☁️ Spotify Integration: Download and process songs directly from Spotify URLs

Technical Features

⚡ High Performance: Rust-powered backend for optimal processing speed
🌐 REST API: Complete HTTP API for integration with external applications
💾 SQLite Database: Lightweight, embedded database for fingerprint storage
🎨 Modern UI: React-based web interface with responsive design
📱 Mobile Support: Touch-friendly interface optimized for mobile devices (Upcoming)
🔄 Real-time Processing: WebSocket support for live audio (Upcoming)

Audio Processing

FFT Analysis: Fast Fourier Transform for frequency domain analysis
Peak Detection: Advanced algorithms for identifying spectral peaks
Fingerprint Generation: Robust hash-based audio fingerprinting
Multi-format Support: MP3, WAV, FLAC, and other common audio formats
Noise Resilience: Handles compressed and noisy audio inputs

🛠️ Tech Stack

Backend (Rust)

Tokio - Asynchronous runtime
Actix Web - HTTP server framework
SQLite - Embedded database
FFmpeg - Audio processing and conversion
Serde - Serialization framework
Reqwest - HTTP client

Frontend (Next.js)

Next.js 14 - React framework
TypeScript - Type-safe JavaScript
Tailwind CSS - Utility-first CSS framework
Radix UI - Headless UI components
Lucide Icons - Modern icon library

Audio Processing

Custom Shazam Algorithm - Proprietary fingerprinting implementation
Spectrogram Analysis - Time-frequency domain processing
Peak Extraction - Constellation map generation
Hash Fingerprinting - Robust audio signatures

📦 Installation

Prerequisites

Ensure you have the following installed:

Rust (1.70 or higher) - Install Rust
Node.js (18 or higher) - Install Node.js
FFmpeg - Audio processing library
SQLite - Database engine

Install FFmpeg

macOS:

brew install ffmpeg

Ubuntu/Debian:

sudo apt update
sudo apt install ffmpeg

Windows: Download from FFmpeg official website

Clone and Setup

# Clone the repository
git clone https://github.com/yourusername/acousti-scan-rs.git
cd acousti-scan-rs

# Build the Rust backend
cargo build --release

# Setup the frontend
cd client
npm install
# or
pnpm install

Database Setup

The application will automatically create the SQLite database on first run:

# Create necessary directories
mkdir -p songs tmp

# Initialize the database (automatic on first API call)
cargo run --release api-server

🚀 Usage

Starting the Application

Start the Backend API Server

# Start the Rust API server (default: localhost:8080)
cargo run --release api-server

# Or specify custom host and port
cargo run --release api-server 0.0.0.0 3001

Start the Frontend Development Server

cd client
npm run dev
# or
pnpm dev

The web interface will be available at http://localhost:3000

Command Line Interface

The application provides several CLI commands for direct interaction:

Identify a Song

# Identify a song from an audio file
cargo run --release find path/to/audio.wav

Add Songs to Database

# Add a single song (with automatic YouTube ID lookup)
cargo run --release save path/to/song.mp3

# Add a song without YouTube ID requirement
cargo run --release save -f path/to/song.mp3

# Add all songs in a directory
cargo run --release save path/to/music/directory/

Download from Spotify

# Download and add a song from Spotify URL
cargo run --release download "https://open.spotify.com/track/4uLU6hMCjMI75M1A2tKUQC"

Database Management

# Clear the entire database
cargo run --release erase

Web Interface Usage

Song Identification

Navigate to the Scan page
Upload an audio file or record directly in the browser
View identification results with confidence scores and YouTube links

Library Management

Access the Library tab to view all stored songs
Search and filter through your music collection
Play preview clips and access YouTube links

Contributing Songs

Use the Contribute tab to add new songs
Upload audio files or provide Spotify URLs
System automatically extracts metadata and creates fingerprints

Audio Processing Pipeline

The system follows this processing pipeline:

Audio Input → File upload or microphone recording
Format Conversion → Convert to WAV using FFmpeg
Spectrogram Generation → Create time-frequency representation
Peak Detection → Identify prominent frequency peaks
Fingerprint Creation → Generate hash-based signatures
Database Storage/Matching → Store or compare against existing fingerprints

📡 API Reference

The system provides a comprehensive REST API for programmatic access:

Base URL

http://localhost:8080/api

Endpoints

Identify Song

POST /api/find
Content-Type: multipart/form-data

# Form data:
file: [audio file binary]

Response:

[
  {
   "song_id": 123,
   "song_title": "Bohemian Rhapsody",
   "song_artist": "Queen",
   "youtube_id": "fJ9rUzIMcZQ",
   "timestamp": 45000,
   "score": 892.5
  }
]

Get Library

GET /api/library

Response:

[
  {
   "id": 1,
   "name": "Bohemian Rhapsody",
   "artist": "Queen",
   "youtube_id": "fJ9rUzIMcZQ",
   "thumbnail_url": "https://img.youtube.com/vi/fJ9rUzIMcZQ/maxresdefault.jpg"
  }
]

Save Song

POST /api/save?force=true
Content-Type: multipart/form-data

# Form data:
file: [audio file binary]

Download from Spotify

POST /api/download
Content-Type: application/json

{
  "spotify_url": "https://open.spotify.com/track/4uLU6hMCjMI75M1A2tKUQC"
}

Clear Database

DELETE /api/erase

API Examples

JavaScript/TypeScript

// Identify a song
async function identifySong(audioFile) {
  const formData = new FormData();
  formData.append('file', audioFile);

  const response = await fetch('http://localhost:8080/api/find', {
   method: 'POST',
   body: formData
  });

  return response.json();
}

// Get library
async function getLibrary() {
  const response = await fetch('http://localhost:8080/api/library');
  return response.json();
}

Python

import requests

def identify_song(file_path):
   with open(file_path, 'rb') as f:
      files = {'file': f}
      response = requests.post('http://localhost:8080/api/find', files=files)
   return response.json()

def get_library():
   response = requests.get('http://localhost:8080/api/library')
   return response.json()

Rust

use reqwest::multipart;

async fn identify_song(file_path: &str) -> Result<serde_json::Value, Box<dyn std::error::Error>> {
   let file = tokio::fs::File::open(file_path).await?;
   let file_part = multipart::Part::stream(file).file_name("audio.mp3");
   let form = multipart::Form::new().part("file", file_part);
   
   let client = reqwest::Client::new();
   let response = client
      .post("http://localhost:8080/api/find")
      .multipart(form)
      .send()
      .await?;
   
   Ok(response.json().await?)
}

For complete API documentation, visit the web interface at /api-docs or check out the API Documentation page.

🏗️ Project Architecture

Directory Structure

acousti-scan-rs/
├── src/                          # Rust backend source code
│   ├── api.rs                   # REST API endpoints and server
│   ├── command_handlers.rs      # CLI command implementations
│   ├── db/                      # Database operations and models
│   ├── download/                # Spotify/YouTube integration
│   ├── models.rs                # Data structures and types
│   ├── shazam/                  # Audio fingerprinting algorithms
│   ├── utils/                   # Helper functions and utilities
│   ├── wav/                     # Audio processing and conversion
│   └── main.rs                  # Application entry point
├── client/                      # Next.js frontend application
│   ├── app/                     # Next.js 14 app router pages
│   ├── components/              # React components
│   ├── lib/                     # Utility functions and API client
│   └── styles/                  # CSS and styling files
├── songs/                       # Processed audio files storage
├── tmp/                         # Temporary file processing
├── Cargo.toml                   # Rust dependencies and metadata
└── README.md                    # This file

Core Components

Backend Architecture

main.rs - CLI argument parsing and application entry point
api.rs - Actix Web server with CORS and multipart file handling
command_handlers.rs - Business logic for CLI operations
shazam/ - Core audio fingerprinting algorithms
db/ - SQLite database operations and schema
wav/ - Audio file processing and FFmpeg integration
download/ - External service integrations

Frontend Architecture

app/ - Next.js pages using the app router
components/ - Reusable React components
lib/api.ts - HTTP client for backend communication
contexts/ - React context providers

Data Flow

Audio Input → FFmpeg Conversion → Spectrogram Analysis → Peak Detection → Fingerprint Generation → Database Storage/Matching → Results Display

🧪 Development

Running Tests

# Run Rust backend tests
cargo test

# Run frontend tests
cd client
npm test

Development Mode

# Backend with auto-reload
cargo install cargo-watch
cargo watch -x "run --release api-server"

# Frontend with hot reload
cd client
npm run dev

Building for Production

# Build optimized Rust binary
cargo build --release

# Build optimized frontend
cd client
npm run build
npm start

Environment Variables

Create .env files for configuration:

Backend (.env):

DATABASE_URL=./db.sqlite3
RUST_LOG=info

Frontend (client/.env.local):

NEXT_PUBLIC_API_URL=http://localhost:8080

Database Schema

The SQLite database contains the following main tables:

songs - Song metadata (title, artist, YouTube ID)
fingerprints - Audio fingerprint hashes and timing data
peaks - Spectral peaks for debugging and analysis

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a feature branch

git checkout -b feature/amazing-feature

Make your changes
Add tests for new functionality
Commit your changes

git commit -m 'Add amazing feature'

Push to the branch

git push origin feature/amazing-feature

Open a Pull Request

Development Guidelines

Follow Rust conventions and use cargo fmt
Write tests for new features
Update documentation for API changes
Use TypeScript for frontend contributions
Follow the existing code style

Issues and Feature Requests

🐛 Bug Reports: Use the bug report template
💡 Feature Requests: Use the feature request template
📚 Documentation: Help improve our docs
🎨 UI/UX: Enhance the user experience

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

MIT License

Copyright (c) 2024 Acousti-Scan RS Contributors

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

🎯 Roadmap

Upcoming Features

Multi-language Support - Internationalization
Cloud Deployment - Docker containers and cloud setup
Advanced Analytics - Detailed usage statistics and insights
Playlist Management - Create and manage song collections
Social Features - Share discoveries and collaborate

Long-term Goals

Machine Learning Integration - Improve identification accuracy
Mobile Apps - Native iOS and Android applications
Plugin System - Extensible architecture for custom analyzers
Real-time Streaming - Live radio and streaming service integration
Enterprise Features - Advanced deployment and management tools

📞 Contact & Support

Project Maintainers

Primary Maintainer: Ankesh Gupta
GitHub: @Ankesh2004

Community

🐛 Bug Reports: GitHub Issues
💬 Discussions: GitHub Discussions
📧 Email: support@acousti-scan.com
💬 Discord: Join our Discord Server

Documentation

📖 API Docs: Available at /api-docs when running the application
🎓 Tutorials: Check out our Wiki
📚 Examples: See the examples/ directory

Made with ❤️ and 🦀 Rust

⭐ Star this repo • 🐛 Report Bug • 💡 Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
client		client
songs		songs
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
db.sqlite3		db.sqlite3
package-lock.json		package-lock.json
rustup-init.exe		rustup-init.exe

Folders and files

Latest commit

History

Repository files navigation

Acousti-Scan RS

📋 Table of Contents

🎵 Overview

✨ Features

Core Functionality

Technical Features

Audio Processing

🛠️ Tech Stack

Backend (Rust)

Frontend (Next.js)

Audio Processing

📦 Installation

Prerequisites

Install FFmpeg

Clone and Setup

Database Setup

🚀 Usage

Starting the Application

Command Line Interface

Identify a Song

Add Songs to Database

Download from Spotify

Database Management

Web Interface Usage

Audio Processing Pipeline

📡 API Reference

Base URL

Endpoints

Identify Song

Get Library

Save Song

Download from Spotify

Clear Database

API Examples

JavaScript/TypeScript

Python

Rust

🏗️ Project Architecture

Directory Structure

Core Components

Backend Architecture

Frontend Architecture

Data Flow

🧪 Development

Running Tests

Development Mode

Building for Production

Environment Variables

Database Schema

🤝 Contributing

Development Guidelines

Issues and Feature Requests

📄 License

🎯 Roadmap

Upcoming Features

Long-term Goals

📞 Contact & Support

Project Maintainers

Community

Documentation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages