Voice-first personal automation for power users.
Speak a command. It transcribes, polishes, and routes — all locally on your Mac.
Quick Start · Features · How It Works · Download
Voice transcription tools turn speech into text — then stop. You still copy-paste into apps, switch windows, route output manually.
The open-source world has two mature, disconnected layers:
- Voice/STT: OpenWhispr, Whispering, VoiceInk
- Workflow automation: n8n, Activepieces, Huginn
Nobody built the bridge. WhisperWoof is that bridge.
|
1. Hold Fn Mando's ears perk up. You're recording. |
➔ |
2. Speak Say whatever you want. Filler words welcome. |
➔ |
3. Release Clean, polished text appears at your cursor. |
Voice ──▶ Local STT (Whisper/Parakeet)
│
▼
Local LLM Polish (Ollama)
Removes filler, fixes grammar
│
▼
Hotkey-driven routing
│
├──▶ Fn → Paste polished text at cursor
├──▶ Fn + T → Add to todo list
├──▶ Fn + N → Save as Markdown note
├──▶ Fn + C → Add to calendar
└──▶ All entries saved to searchable history
|
|
# Clone and run
git clone https://github.com/h3qing/whisperwoof.git
cd whisperwoof
npm install
npm startOr download the app directly: Latest .dmg release (Apple Silicon)
Optional — install Ollama for AI text polishing:
brew install ollama && ollama pull llama3.2:1b && ollama serve- macOS (Apple Silicon recommended)
- Microphone (built-in or external)
- Ollama (optional) — for local AI text polish. Install Ollama
| Principle | What it means |
|---|---|
| Hotkey = intent | The key combo you press determines where voice goes. Explicit over magic. |
| Local-first | Everything runs on your machine. No cloud. No data leaving your device. |
| Fork, don't reinvent | Built on OpenWhispr's proven STT engine and Electron shell. |
| Power users first | Control, customization, and ownership of your tools. |
| Layer | Technology |
|---|---|
| Runtime | Electron 39 + React 19 + TypeScript + Tailwind CSS v4 |
| STT | OpenAI Whisper / NVIDIA Parakeet (local) |
| LLM Polish | Ollama (local, optional — works without it) |
| Storage | SQLite + Kysely ORM + FTS5 full-text search |
| Plugins | Model Context Protocol (MCP) |
- Phase 0 — Fork + security hardening + test infrastructure
- Phase 1 — Core pipeline: StorageProvider, Ollama polish, hotkey routing, features
- Phase 2 — MCP plugin system (Todoist, Notion, Slack, Calendar)
- Phase 3 — Polish, onboarding, public release (v1.0)
- Phases 4–10 — Competitive features, AI intelligence, vibe coding, streaming, templates
- Meeting recording — Crash-safe audio buffer, transcript checkpoints, Granola-style detection
- Agent mode — Voice-driven AI chat with streaming LLM responses
- Distribution — Code signing, notarization, auto-update
WhisperWoof is a fork of OpenWhispr — we're grateful to the OpenWhispr team for building such a solid foundation.
Also built on: OpenAI Whisper · NVIDIA Parakeet · Ollama · Model Context Protocol
WhisperWoof is in early development. Contributions, feedback, and ideas are welcome — please open an issue to discuss before submitting a PR.
MIT — see LICENSE for details.
Named after Mando, who always listens.
Built with care by Heqing.