WhisperWoof

Voice-first personal automation for power users.
Speak a command. It transcribes, polishes, and routes — all locally on your Mac.

Quick Start · Features · How It Works · Download

The Problem

Voice transcription tools turn speech into text — then stop. You still copy-paste into apps, switch windows, route output manually.

The open-source world has two mature, disconnected layers:

Voice/STT: OpenWhispr, Whispering, VoiceInk
Workflow automation: n8n, Activepieces, Huginn

Nobody built the bridge. WhisperWoof is that bridge.

How It Works

1. Hold Fn
_{Mando's ears perk up.
You're recording.}

➔

2. Speak
_{Say whatever you want.
Filler words welcome.}

➔

3. Release
_{Clean, polished text
appears at your cursor.}

Voice ──▶ Local STT (Whisper/Parakeet)
              │
              ▼
         Local LLM Polish (Ollama)
         Removes filler, fixes grammar
              │
              ▼
         Hotkey-driven routing
              │
              ├──▶ Fn         → Paste polished text at cursor
              ├──▶ Fn + T     → Add to todo list
              ├──▶ Fn + N     → Save as Markdown note
              ├──▶ Fn + C     → Add to calendar
              └──▶ All entries saved to searchable history

Features

Core Pipeline

Local voice-to-text — Whisper STT on your machine. No cloud, no latency, no data leaving your laptop.
AI text polish — Ollama removes filler, fixes grammar, adds punctuation. 5 presets or write your own.
Hotkey-driven routing — Different combos send voice to different destinations. Explicit, not magic.

Capture & History

Unified clipboard + voice history — Everything you say or copy, searchable. Images too.
Audio playback — Tap any entry to replay the original recording.
Full-text search — SQLite FTS5 across all your voice and clipboard entries.

Intelligence

Context-aware — Detects active app. VS Code gets code style, Slack gets casual, Mail gets professional.
Voice commands — "Rewrite this." "Translate to Spanish." "Summarize." 10 editing commands.
Cmd+K command bar — Spotlight-style overlay. Type /todo, /note, /project.
Agent mode — Voice-driven AI chat. Press hotkey, speak, get streamed LLM responses.

Meeting Recording (new)

Granola-style detection — Detects meetings via calendar + mic + process signals. Shows persistent notification.
Pre-meeting alerts — Notification appears ~90s before scheduled meetings.
Crash-safe audio — Audio saved to local WAV files in 5-minute segments. Never lose a meeting.
Transcript checkpoints — Saved to SQLite every 60s. Survives crashes and network drops.
Auto-reconnect — WebSocket reconnection with backoff + session rotation at 25 minutes.

Smart Clipboard

Kanban board — Organize reusable text snippets into boards (Greetings, Work, Code, etc.)
Quick paste — Copy any snippet with one click. Hotkey paste with Cmd+Shift+1-9.
Frequency tracking — See which snippets you use most. Sorted by usage count.
Source tracking — Know if a snippet was typed manually, AI-generated, or captured from voice.

Privacy & Design

Privacy lock — One toggle blocks ALL cloud access. Ollama-only, zero network.
MCP plugins — Route voice to Todoist, Notion, Slack. Any MCP server works as a plugin.
Mando's ears — The floating indicator has dog ears that perk up when you speak.

Quick Start

# Clone and run
git clone https://github.com/h3qing/whisperwoof.git
cd whisperwoof
npm install
npm start

Or download the app directly: Latest .dmg release (Apple Silicon)

Optional — install Ollama for AI text polishing:

brew install ollama && ollama pull llama3.2:1b && ollama serve

Requirements

macOS (Apple Silicon recommended)
Microphone (built-in or external)
Ollama (optional) — for local AI text polish. Install Ollama

Design Principles

Principle	What it means
Hotkey = intent	The key combo you press determines where voice goes. Explicit over magic.
Local-first	Everything runs on your machine. No cloud. No data leaving your device.
Fork, don't reinvent	Built on OpenWhispr's proven STT engine and Electron shell.
Power users first	Control, customization, and ownership of your tools.

Tech Stack

Layer	Technology
Runtime	Electron 39 + React 19 + TypeScript + Tailwind CSS v4
STT	OpenAI Whisper / NVIDIA Parakeet (local)
LLM Polish	Ollama (local, optional — works without it)
Storage	SQLite + Kysely ORM + FTS5 full-text search
Plugins	Model Context Protocol (MCP)

Roadmap

Phase 0 — Fork + security hardening + test infrastructure
Phase 1 — Core pipeline: StorageProvider, Ollama polish, hotkey routing, features
Phase 2 — MCP plugin system (Todoist, Notion, Slack, Calendar)
Phase 3 — Polish, onboarding, public release (v1.0)
Phases 4–10 — Competitive features, AI intelligence, vibe coding, streaming, templates
Meeting recording — Crash-safe audio buffer, transcript checkpoints, Granola-style detection
Agent mode — Voice-driven AI chat with streaming LLM responses
Distribution — Code signing, notarization, auto-update

Credits

WhisperWoof is a fork of OpenWhispr — we're grateful to the OpenWhispr team for building such a solid foundation.

Also built on: OpenAI Whisper · NVIDIA Parakeet · Ollama · Model Context Protocol

Contributing

WhisperWoof is in early development. Contributions, feedback, and ideas are welcome — please open an issue to discuss before submitting a PR.

License

MIT — see LICENSE for details.

_{Named after Mando, who always listens.}
_{Built with care by Heqing.}

Name		Name	Last commit message	Last commit date
Latest commit History 1,046 Commits
.claude		.claude
.github		.github
docs		docs
eval		eval
plugins		plugins
resources		resources
scripts		scripts
src		src
website		website
.env.example		.env.example
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
DESIGN.md		DESIGN.md
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
electron-builder.json		electron-builder.json
eslint.config.js		eslint.config.js
findings.md		findings.md
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
preload.js		preload.js
progress.md		progress.md
task_plan.md		task_plan.md
vitest.config.mts		vitest.config.mts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhisperWoof

The Problem

How It Works

Features

Core Pipeline

Capture & History

Intelligence

Meeting Recording (new)

Smart Clipboard

Privacy & Design

Quick Start

Requirements

Design Principles

Tech Stack

Roadmap

Credits

Contributing

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

WhisperWoof

The Problem

How It Works

Features

Core Pipeline

Capture & History

Intelligence

Meeting Recording (new)

Smart Clipboard

Privacy & Design

Quick Start

Requirements

Design Principles

Tech Stack

Roadmap

Credits

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages