Dictant.app - Minimalistic dictation app for Mac (OpenAI Whisper wrapper)

Dictant is a tiny (<1 MB) macOS menu bar push-to-talk app that turns your voice into polished text with OpenAI Whisper and optional ChatGPT cleanup.

Free and open-source — there are no paywalls, you only cover your own OpenAI API usage.

Download the latest version from the Releases page

Why Dictant

Built for speed: hold the right Command key or tap the menu bar icon to start/stop recording in a second.
Clipboard automation: copy and optionally paste the result straight into the active app.
Reliable status cues: menu bar icon blinks red while recording and green while processing.
Smart post-processing: optional ChatGPT pass with your own system prompt for better output.
Persistent history: recordings and transcripts live locally so you can retry failed jobs.
macOS-first: SwiftUI, native notifications, Keychain storage

Core Features

Push-to-talk: Hold the right ⌘ (Command key) for 1 second to start recording; release to stop
Clipboard & paste: Copy transcripts to the clipboard; auto-paste into the active text field (requires Accessibility).
ChatGPT post-processing: Run transcripts through GPT with your custom system prompt.
History & retries: Browse recordings, re-run failed/pending transcriptions, copy or open files in Finder, and clear history.
Smart Silence Removal: Automatically trims long pauses and silence from your audio before processing to improve transcription accuracy and reduce API usage.
Privacy-aware: API key stays in Keychain, audio files stay local (Application Support), only the transcription request hits OpenAI.

Quick Start

Launch the app; it lives in the menu bar.
Open Settings → Processing, paste your OpenAI API key, and save it (stored in Keychain).
Press and hold the right Command key for 1s to start talking, then release to stop. Or click the menu bar icon to toggle.
Optional tweaks in Settings → General:
- Run at system startup
- Enable push-to-talk
- Copy to clipboard
- Paste into the active input
Check Settings → History for transcripts; copy, re-run, or open recordings from there.

Requirements

macOS 14.0+ (built with Xcode 15+)
An OpenAI API key with Whisper access (create one at https://platform.openai.com/account/api-keys)
Internet access for transcription and optional ChatGPT post-processing

Install from Source

Clone the repository:

git clone https://github.com/sbrin/Dictant.git
cd Dictant

Open the project: Open Dictant.xcodeproj in Xcode.
Configure Building:
- Select the Dictant scheme from the target selector at the top.
- If prompted about signing, select your personal Development Team in the "Signing & Capabilities" tab of the project settings.
Build and Run: Press ⌘R or click the Play button to build and run the app.

Building and Packaging

If you want to build and package the app for distribution, use the scripts in the packaging/ directory.

Prerequisites for Packaging

No special tools are required for .pkg creation as it uses native pkgbuild.

Build Commands

Build PKG: ./packaging/build_and_pkg.sh

The artifacts will be placed in the build/ directory.

Build a PKG

Use the bundled helper scripts to create a standard macOS installer package. ** PKG Installer requires extra privacy and security permission via System Settings **

Build and package in one shot: ./packaging/build_and_pkg.sh

Versioning Notes

packaging/build_and_pkg.sh uses Xcode build settings (MARKETING_VERSION and CURRENT_PROJECT_VERSION) because the Info.plist is generated at build time. If you do not pass CURRENT_PROJECT_VERSION, the script increments the current build number automatically.

Usage Notes

Menu bar and pointer states: solid icon (idle), blinking red (recording), blinking green (processing).
Status menu (right-click): start/stop, cancel processing, open settings, open history, quit.
Auto-paste: Requires Accessibility permission; if missing, the app will prompt and temporarily disable paste until trusted.
ChatGPT prompt: Set your own system prompt to shape the post-processed text (defaults to a polishing prompt).

Permissions

Microphone: Required to record audio.
Accessibility: Needed for auto-paste and the global push-to-talk hotkey.
Notifications: Used for success/failure and permission guidance.

Troubleshooting

Invalid or missing API key: Add a valid key in Settings → Processing; the app surfaces notifications when the key is rejected.
Cannot start recording: Ensure microphone permission is granted in System Settings → Privacy & Security → Microphone.
Auto-paste disabled: Enable Accessibility access for Dictant in System Settings → Privacy & Security → Accessibility.
Short recordings discarded: Clips under a few seconds (or entirely silent/too quiet) are dropped automatically.
Retries: Use Settings → History to re-run failed or pending transcriptions.

Contributing

Open issues for bugs and feature ideas; PRs are welcome.
Please describe repro steps and expected behavior when filing bugs.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Dictant.xcodeproj		Dictant.xcodeproj
Dictant		Dictant
DictantTests		DictantTests
DictantUITests		DictantUITests
packaging		packaging
.cursorrules		.cursorrules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agents.md		agents.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dictant.app - Minimalistic dictation app for Mac (OpenAI Whisper wrapper)

Why Dictant

Core Features

Quick Start

Requirements

Install from Source

Building and Packaging

Prerequisites for Packaging

Build Commands

Build a PKG

Versioning Notes

Usage Notes

Permissions

Troubleshooting

Contributing

License

About

Uh oh!

Releases 3

Languages

License

sbrin/Dictant

Folders and files

Latest commit

History

Repository files navigation

Dictant.app - Minimalistic dictation app for Mac (OpenAI Whisper wrapper)

Why Dictant

Core Features

Quick Start

Requirements

Install from Source

Building and Packaging

Prerequisites for Packaging

Build Commands

Build a PKG

Versioning Notes

Usage Notes

Permissions

Troubleshooting

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Languages