Wispah Flow

Free and open source alternative to Wispr Flow, Superwhisper, and Monologue.
Press a hotkey, speak, and your words appear at the cursor - adapted to what's on screen.

⬇ Download Wispah.dmg
_{macOS 13+ · Apple Silicon + Intel}

Features

Privacy-first - no servers, no accounts, no telemetry. The only network calls are to your chosen provider's API. Audio is processed and discarded, nothing stored externally.
Bring your own API key - choose between Groq (free) and OpenAI, pick your transcription and post-processing models, switch providers anytime
Context-aware transcription - takes a screenshot when you start recording, then uses it to get names, terminology, and formatting right. Replying to an email? It'll spell the person's name correctly. Writing code? It'll match the syntax.
Customizable post-processing - everything is a toggle. Want raw transcription with no processing? Turn it all off. Want the full pipeline? Enable smart formatting (auto-detects lists, paragraphs), smart corrections (cleans up "wait no, I meant..." mid-speech), developer mode (recognizes code terms), and screen context - mix and match to fit how you work.
Two recording modes - hold-to-record (push-to-talk style) and toggle (press to start, press to stop), each with its own hotkey
Paste at cursor - transcription goes straight to wherever your cursor is, with smart leading-space detection so it doesn't smash into existing text
Live recording overlay - floating pill with waveform visualization, state transitions, and a smooth slide-to-notch animation
Auto-updates - checks GitHub Releases in the background with a 3-day stability buffer. Downloads the DMG, replaces the app, and relaunches - all with one click.
Pause media while recording - optionally pauses music/video during recording, resumes when done
Transcription history - searchable log of every transcription with audio playback
Usage stats - words transcribed, recording time, streaks, words per minute

Why I Built This

Honestly, I built this for myself. I tried Wispr Flow, other open source alternatives, and nothing had everything I wanted in one place. Context-aware formatting existed in some tools. Developer mode existed in others. But they were either too slow, unreliable, buggy, or missing that one feature I really wanted: auto-pausing music while recording. Sounds small, but it was the dealbreaker.

I forked FreeFlow by Zach Latta because it had the best UI/UX and was the most reliable of everything I tried. From there I added everything I was missing: full customizability, stats, optional screen context recording, custom hotkeys, sound toggles, failed transcription retry, post-processing with developer mode, cleanup to ensure the output is always good, and a bunch of other things - and of course, my beloved pause music on record.

I use it every day, so I'll keep improving it - but it'll always be free, open source, and yours to own. Check the roadmap, it's a fun one.

API Providers

Wispah Flow supports Groq and OpenAI as API providers. Pick one during setup, switch anytime in Settings. Both keys are saved — switching is instant.

Provider	Transcription	Post-Processing	Free Tier
Groq	Whisper Large V3 / Turbo	Llama 4 Scout / 3.3 70B	Yes — no credit card needed
OpenAI	GPT-4o Mini Transcribe / Transcribe / Whisper 1	GPT-5 Nano / Mini / 5, GPT-4.1 Nano / Mini / 4.1	No — pay-as-you-go

Why Groq is the default: It's free, fast (custom LPU hardware), and what we inherited from FreeFlow. For most users it's all you need.

Setup

Download from Releases
Get an API key — Groq (free) or OpenAI
Open the app and follow the setup wizard

The wizard walks you through picking a provider, granting permissions (microphone, accessibility, screen recording), and configuring your hotkeys.

Privacy

No servers, no accounts, no tracking. The only network calls are to your chosen provider's API for transcription and context processing. Audio is processed and discarded - nothing is stored or retained externally.

Roadmap

For Developers

See TECHNICAL_README.md for build instructions, architecture, and how to contribute. The project includes full Claude Code setup (CLAUDE.md + skills) for AI-assisted development.

Credits

Wispah Flow is a fork of FreeFlow by Zach Latta. Original project licensed under MIT.

License

MIT License. See LICENSE.

Third-party dependencies are listed in THIRD_PARTY_LICENSES.md.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.claude		.claude
.github/workflows		.github/workflows
Resources		Resources
Sources		Sources
.gitignore		.gitignore
Info.plist		Info.plist
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
TECHNICAL_README.md		TECHNICAL_README.md
THIRD_PARTY_LICENSES.md		THIRD_PARTY_LICENSES.md
Wispah.entitlements		Wispah.entitlements

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wispah Flow

Features

Why I Built This

API Providers

Setup

Privacy

Roadmap

For Developers

Credits

License

About

Uh oh!

Releases 14

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Wispah Flow

Features

Why I Built This

API Providers

Setup

Privacy

Roadmap

For Developers

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 14

Contributors

Uh oh!

Languages