echo-sense

CLI-first audio fingerprinting experiment (Shazam-style) written in Go.

output.mp4

Given a Spotify track URL, echo-sense:

Fetches track metadata from Spotify
Finds a matching YouTube video
Downloads audio via yt-dlp (saved as .m4a)
Converts to WAV via ffmpeg
Builds a spectrogram, extracts peaks, generates fingerprints
Stores songs + fingerprints in Postgres (via GORM)

You can then run a local search against a downloaded track file.

Status

This repo is actively evolving. The download + fingerprint generation path is implemented; match scoring / “print the best match” flow is still a work in progress (matching currently computes intermediate structures but does not output a final match result).

Project Layout

server/: Go module (the CLI lives here)
server/db/: Postgres connection + GORM models
server/internals/recognisingAlgorithm/: spectrogram + peak picking + fingerprinting + matching logic
server/internals/spotify/: Spotify metadata + YouTube lookup + audio download pipeline
songs/: created at runtime; downloaded .m4a files end up here
temporary_files/: created at runtime

Prerequisites

Go (see server/go.mod for the configured version)
Postgres (local or remote)
yt-dlp available on your PATH
ffmpeg available on your PATH

Setup

Install dependencies

macOS (brew):
- brew install ffmpeg postgresql@16 yt-dlp
Linux (example):
- sudo apt-get install -y ffmpeg postgresql
- python3 -m pip install -U yt-dlp

Create server/.env

echo-sense loads environment variables from a .env file at startup (server/main.go).

Example:

# Postgres (note: current code does not use DB_PASSWORD in the DSN)
DB_HOST=localhost
DB_PORT=5432
DB_USER=postgres
DB_NAME=echo_sense

# Spotify “Client Credentials” flow
SPOTIFY_CLIENT_ID=...
SPOTIFY_CLIENT_SECRET=...

Spotify credentials:

Create an app in the Spotify Developer Dashboard.
Copy the Client ID + Client Secret into server/.env.

Start Postgres

Ensure the user in DB_USER can connect without a password (the current DSN in server/db/db.go does not include DB_PASSWORD).

On startup, the app will:

Connect to Postgres
Create DB_NAME if it doesn’t exist
Auto-migrate the songs and audio_fingerprints tables

Usage

Run everything from server/:

cd server
go run . download "<spotify_track_url>"

This creates (if missing) songs/ and downloads a file named like:

songs/<Artist> - <Title>.m4a

It also fingerprints the track and stores results in Postgres.

Search (local file)

search looks for a file in songs/ by the base filename (without extension):

cd server
go run . search "Artist - Title"

This converts the .m4a to WAV, generates fingerprints from the audio, and runs the matching pipeline against fingerprints stored in the database.

Notes & Troubleshooting

If you see ffmpeg error: exec: "ffmpeg": executable file not found in $PATH, install ffmpeg and ensure it’s on your PATH.
If you see exec: "yt-dlp": executable file not found in $PATH, install yt-dlp and ensure it’s on your PATH.
If Postgres connection fails, double-check DB_HOST, DB_PORT, and that DB_USER can connect to Postgres without a password (or update server/db/db.go to include a password in the DSN).
Spotify auth uses a cached token file spotify_token.json in the working directory; don’t commit it.

How It Works (High Level)

Convert audio to a normalized WAV format (server/internals/wavService/wav.go)
Compute a spectrogram (server/internals/recognisingAlgorithm/spectrogram.go)
Extract salient peaks (server/internals/recognisingAlgorithm/*)
Turn peak pairs into compact fingerprints and store them (server/internals/recognisingAlgorithm/fingerprinting.go, server/db/models/*)
For a query clip, generate fingerprints and look up candidate matches in Postgres (server/internals/recognisingAlgorithm/matchTracks.go)

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
frontend		frontend
server		server
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

echo-sense

Status

Project Layout

Prerequisites

Setup

Usage

Search (local file)

Notes & Troubleshooting

How It Works (High Level)

About

Uh oh!

Releases

Packages

Languages

Pritam-deb/echo-sense

Folders and files

Latest commit

History

Repository files navigation

echo-sense

Status

Project Layout

Prerequisites

Setup

Usage

Search (local file)

Notes & Troubleshooting

How It Works (High Level)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages