Sussurro

Sussurro is a fully local, open-source, cross-platform CLI voice-to-text system that acts as a system-wide AI dictation layer. It transforms speech into clean, formatted, context-aware text injected into any application.

New to Sussurro? Start with the Quick Start Guide to get running in under 5 minutes.

Overview

Sussurro uses local AI models to ensure privacy and low latency. It combines:

Whisper.cpp for automatic speech recognition (ASR).
LLMs (Qwen 3 Sussurro) for intelligent text cleanup, removing filler words, and fixing grammar errors.

Features

CLI-First: Lightweight command-line interface controlled by configuration files.
Smart Cleanup:
- Filler Removal: Automatically removes "umm", "ah", "like".
- Self-Correction: Handles speech repairs (e.g., "I want blue... no red" -> "I want red").
- Guardrails: Algorithmic checks to ensure accurate transcription and prevent hallucinations.
Local Processing: No data leaves your machine.
System-Wide: Works in any application where you can type.
Configurable: Load custom configs at runtime.
Cross-Platform: Designed for macOS, Windows and Linux.
Flexible ASR: Choose between Whisper Small (fast) or Whisper Large v3 Turbo (accurate) at setup or later.

Documentation

Quick Start: Get up and running in under 5 minutes (recommended for first-time users).
Dependencies: System requirements and package installation for your platform.
Wayland Setup: Setup guide for Wayland users (one-time configuration required).
Configuration: Detailed guide on default.yaml and environment variables.
Architecture: Learn how the audio pipeline, ASR, and LLM engines work together.
Compilation: Instructions for building from source.

Getting Started

Choose your platform below for specific instructions:

macOS

Using Prebuilt Binary:

Download sussurro-macos-<arch>.tar.gz from the GitHub Releases page

Extract and prepare:

tar -xzf sussurro-macos-*.tar.gz
cd sussurro-macos-<arch>
chmod +x sussurro trigger.sh
xattr -d com.apple.quarantine sussurro  # Remove macOS quarantine

Run:
```
./sussurro
```
On first run, Sussurro will guide you through model download

Usage: Hold Cmd+Shift+Space to talk, release to transcribe. Works immediately, no configuration needed.

Building from Source: See Compilation Guide

Linux (Arch/Manjaro)

Step 1: Install Dependencies

# For Wayland users (GNOME Wayland, KDE Wayland, Sway, Hyprland)
sudo pacman -S wl-clipboard

# For X11 users (optional, for window context detection)
sudo pacman -S xdotool xorg-xprop

Step 2: Get Sussurro

Option A: Prebuilt Binary

Download sussurro-linux-<arch>.tar.gz from GitHub Releases
Extract and prepare:
```
tar -xzf sussurro-linux-*.tar.gz
cd sussurro-linux-<arch>
chmod +x sussurro trigger.sh
```
Package includes: sussurro binary, trigger.sh script (for Wayland), example config
Run:
```
./sussurro
```

Option B: Build from Source

git clone https://github.com/cesp99/sussurro.git
cd sussurro
make build
./bin/sussurro

Step 3: First Run Setup

Sussurro will create ~/.sussurro/config.yaml
Follow prompts to download AI models

Step 4: Configure Keyboard Shortcut (Wayland Only)

If using Wayland (check with echo $XDG_SESSION_TYPE):

GNOME:

Settings → Keyboard → Keyboard Shortcuts → Custom Shortcuts
Click "+" to add new
Name: "Sussurro Voice Input"
Command: /full/path/to/extracted/folder/trigger.sh
Set shortcut: Ctrl+Shift+Space

KDE Plasma:

System Settings → Shortcuts → Custom Shortcuts
Right-click → New → Global Shortcut → Command/URL
Trigger tab: Set Ctrl+Shift+Space
Action tab: /full/path/to/extracted/folder/trigger.sh

Sway/Hyprland: See Wayland Setup Guide

Usage:

X11: Hold Ctrl+Shift+Space, talk, release (works immediately)
Wayland: Press once to start, talk, press again to stop

Linux (Ubuntu/Debian)

Step 1: Install Dependencies

# For Wayland users
sudo apt install wl-clipboard

# For X11 users (optional)
sudo apt install xdotool x11-utils

Step 2-4: Same as Arch instructions above

Linux (Other Distributions)

See dependencies.md for your distribution's package manager commands, then follow the Arch instructions above.

Windows

Status: Not yet tested. Contributions welcome.

Quick Reference

Platform	Hotkey Behavior	Setup Required
macOS	Hold to talk	None
Linux X11	Hold to talk	None
Linux Wayland	Toggle (press twice)	One-time DE shortcut

Troubleshooting: See dependencies.md

Switching Whisper Models

You can switch the Whisper ASR model at any time using --whisper (or --wsp):

./sussurro --whisper
# or
./sussurro --wsp

This opens an interactive menu to switch between:

Model	Size	Best for
Whisper Small	488 MB	Faster transcription, lower RAM usage
Whisper Large v3 Turbo	1.62 GB	Higher accuracy, slower

If the selected model isn't already downloaded, Sussurro will offer to download it. The model choice is also prompted during first-run setup.

License

This project is licensed under the GNU General Public License v3.0 (GPLv3) - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
cmd/sussurro		cmd/sussurro
configs		configs
docs		docs
internal		internal
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sussurro

Overview

Features

Documentation

Getting Started

macOS

Linux (Arch/Manjaro)

Linux (Ubuntu/Debian)

Linux (Other Distributions)

Windows

Quick Reference

Switching Whisper Models

License

About

Uh oh!

Releases 5

Packages

Languages

License

cesp99/sussurro

Folders and files

Latest commit

History

Repository files navigation

Sussurro

Overview

Features

Documentation

Getting Started

macOS

Linux (Arch/Manjaro)

Linux (Ubuntu/Debian)

Linux (Other Distributions)

Windows

Quick Reference

Switching Whisper Models

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages