Sussurro is a fully local, open-source, cross-platform CLI voice-to-text system that acts as a system-wide AI dictation layer. It transforms speech into clean, formatted, context-aware text injected into any application.
New to Sussurro? Start with the Quick Start Guide to get running in under 5 minutes.
Sussurro uses local AI models to ensure privacy and low latency. It combines:
- Whisper.cpp for automatic speech recognition (ASR).
- LLMs (Qwen 3 Sussurro) for intelligent text cleanup, removing filler words, and fixing grammar errors.
- CLI-First: Lightweight command-line interface controlled by configuration files.
- Smart Cleanup:
- Filler Removal: Automatically removes "umm", "ah", "like".
- Self-Correction: Handles speech repairs (e.g., "I want blue... no red" -> "I want red").
- Guardrails: Algorithmic checks to ensure accurate transcription and prevent hallucinations.
- Local Processing: No data leaves your machine.
- System-Wide: Works in any application where you can type.
- Configurable: Load custom configs at runtime.
- Cross-Platform: Designed for macOS, Windows and Linux.
- Flexible ASR: Choose between Whisper Small (fast) or Whisper Large v3 Turbo (accurate) at setup or later.
- Quick Start: Get up and running in under 5 minutes (recommended for first-time users).
- Dependencies: System requirements and package installation for your platform.
- Wayland Setup: Setup guide for Wayland users (one-time configuration required).
- Configuration: Detailed guide on
default.yamland environment variables. - Architecture: Learn how the audio pipeline, ASR, and LLM engines work together.
- Compilation: Instructions for building from source.
Choose your platform below for specific instructions:
Using Prebuilt Binary:
- Download
sussurro-macos-<arch>.tar.gzfrom the GitHub Releases page - Extract and prepare:
tar -xzf sussurro-macos-*.tar.gz cd sussurro-macos-<arch> chmod +x sussurro trigger.sh xattr -d com.apple.quarantine sussurro # Remove macOS quarantine
- Run:
On first run, Sussurro will guide you through model download
./sussurro
Usage: Hold Cmd+Shift+Space to talk, release to transcribe. Works immediately, no configuration needed.
Building from Source: See Compilation Guide
Step 1: Install Dependencies
# For Wayland users (GNOME Wayland, KDE Wayland, Sway, Hyprland)
sudo pacman -S wl-clipboard
# For X11 users (optional, for window context detection)
sudo pacman -S xdotool xorg-xpropStep 2: Get Sussurro
Option A: Prebuilt Binary
- Download
sussurro-linux-<arch>.tar.gzfrom GitHub Releases - Extract and prepare:
Package includes:
tar -xzf sussurro-linux-*.tar.gz cd sussurro-linux-<arch> chmod +x sussurro trigger.sh
sussurrobinary,trigger.shscript (for Wayland), example config - Run:
./sussurro
Option B: Build from Source
git clone https://github.com/cesp99/sussurro.git
cd sussurro
make build
./bin/sussurroStep 3: First Run Setup
- Sussurro will create
~/.sussurro/config.yaml - Follow prompts to download AI models
Step 4: Configure Keyboard Shortcut (Wayland Only)
If using Wayland (check with echo $XDG_SESSION_TYPE):
GNOME:
- Settings → Keyboard → Keyboard Shortcuts → Custom Shortcuts
- Click "+" to add new
- Name: "Sussurro Voice Input"
- Command:
/full/path/to/extracted/folder/trigger.sh - Set shortcut:
Ctrl+Shift+Space
KDE Plasma:
- System Settings → Shortcuts → Custom Shortcuts
- Right-click → New → Global Shortcut → Command/URL
- Trigger tab: Set
Ctrl+Shift+Space - Action tab:
/full/path/to/extracted/folder/trigger.sh
Sway/Hyprland: See Wayland Setup Guide
Usage:
- X11: Hold
Ctrl+Shift+Space, talk, release (works immediately) - Wayland: Press once to start, talk, press again to stop
Step 1: Install Dependencies
# For Wayland users
sudo apt install wl-clipboard
# For X11 users (optional)
sudo apt install xdotool x11-utilsStep 2-4: Same as Arch instructions above
See dependencies.md for your distribution's package manager commands, then follow the Arch instructions above.
Status: Not yet tested. Contributions welcome.
| Platform | Hotkey Behavior | Setup Required |
|---|---|---|
| macOS | Hold to talk | None |
| Linux X11 | Hold to talk | None |
| Linux Wayland | Toggle (press twice) | One-time DE shortcut |
Troubleshooting: See dependencies.md
You can switch the Whisper ASR model at any time using --whisper (or --wsp):
./sussurro --whisper
# or
./sussurro --wspThis opens an interactive menu to switch between:
| Model | Size | Best for |
|---|---|---|
| Whisper Small | 488 MB | Faster transcription, lower RAM usage |
| Whisper Large v3 Turbo | 1.62 GB | Higher accuracy, slower |
If the selected model isn't already downloaded, Sussurro will offer to download it. The model choice is also prompted during first-run setup.
This project is licensed under the GNU General Public License v3.0 (GPLv3) - see the LICENSE file for details.