TuneTracer's Algorithm

1. WAV Loading & Preprocessing

Raw PCM extraction: We parse the RIFF/WAVE header ourselves (no external libraries) to load 16 bit PCM samples.
Spectrogram (STFT): Audio is framed into 2048-sample windows (≈46 ms) with 50% overlap, a Hann window is applied, and a radix-2 FFT is run to produce a high-resolution magnitude spectrogram (S[t,f]).

2. Peak Detection (“Constellation”)

Local maxima: For each time frame and frequency bin, we detect peaks that exceed a dynamic threshold (mean + 1.5 σ) and are the highest in a 7×7 neighborhood.
Node list: Each peak (p_i = (t_i, f_i, A_i)) becomes a node in our fingerprint graph.

3. Graph-Based Descriptors

Neighbor graph: For each peak (p_i), we connect it to up to K = 5 future peaks (p_j) within ±Δf (e.g. ±50 bins) and Δt (up to 1 s).
Amplitude weighting: Each edge is weighted by (A_i \times A_j), so louder events carry more significance.
Local descriptor: We record ((\Delta t, \Delta f, w)) for each of the K strongest edges, forming a 3×K vector around each anchor.

4. Quantization & Hashing

Bins: (\Delta t) quantized into 16 time-bins, (\Delta f) into 32 freq-bins, and (w) into 8 amplitude-bins.
FNV-1a hashing: The 3K bytes are concatenated and hashed into a 64-bit fingerprint code, compactly encoding subgraph structure.

5. Persistent Indexing

All (code, songId, timestamp) tuples are bulk‐inserted into SQLite with an index on code.
This on-disk database scales to millions of fingerprints and requires no external key-value service.

6. Query & Matching

Fingerprint snippet: The same STFT → peaks → graph → hashes pipeline runs on the query.
Lookup: Each code is looked up in SQLite, returning (songId, dbTimestamp) entries.
Voting: We tally votes for each (songId, Δt = queryTime – dbTime) pair.
Best match: The song and time-offset with the highest vote count wins.

Why Tune Tracer Beats Classic Shazam

Richer descriptors: Captures a small subgraph of K edges per anchor vs. a single peak pair.
Amplitude-aware: Weighs edges by peak magnitudes for robustness to noise.
Finer quantization: More granular time/frequency bins for higher discrimination.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CoreApp		CoreApp
WebApp		WebApp
audio		audio
.gitignore		.gitignore
LICENSE		LICENSE
Linkin Park_ New Divide [Live 4K] (Paris, France - November 3, 2024).mp3		Linkin Park_ New Divide [Live 4K] (Paris, France - November 3, 2024).mp3
New Divide- Linkin Park.mp3		New Divide- Linkin Park.mp3
README.md		README.md
TuneTracer.sln		TuneTracer.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TuneTracer's Algorithm

1. WAV Loading & Preprocessing

2. Peak Detection (“Constellation”)

3. Graph-Based Descriptors

4. Quantization & Hashing

5. Persistent Indexing

6. Query & Matching

Why Tune Tracer Beats Classic Shazam

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TuneTracer's Algorithm

1. WAV Loading & Preprocessing

2. Peak Detection (“Constellation”)

3. Graph-Based Descriptors

4. Quantization & Hashing

5. Persistent Indexing

6. Query & Matching

Why Tune Tracer Beats Classic Shazam

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages