This uses OpenAI's Whisper API - the same technology powering ChatGPT voice mode.
- ✅ 95%+ accuracy (same as ChatGPT voice)
- ✅ 99 languages including Bulgarian (bg)
- ✅ Auto language detection
- ✅ Handles accents, background noise, multiple speakers
- ✅ Real-time continuous transcription
pip install openai pyaudioOr double-click: install_whisper.bat
- Go to: https://platform.openai.com/api-keys
- Sign up / Log in
- Click "Create new secret key"
- Copy the key (starts with
sk-...)
- $0.006 per minute (~$0.36 per hour)
- Much cheaper than human transcription
- Pay only for what you use
- Run:
python audio_transcriber_whisper.py - Paste your API key
- Click "Set Key"
- Select microphone
- Choose language:
auto- Auto-detect (recommended)en- Englishbg- Bulgarianes- Spanish- etc.
- Click "Start Listening"
-
Chunk Duration:
- 3-5 seconds = faster response, good for conversations
- 7-10 seconds = better accuracy for long sentences
-
Language Setting:
- Use
autofor mixed languages - Set specific language (e.g.,
bg) for better accuracy in that language
- Use
-
Microphone:
- Use a good quality mic (HyperX, Trust USB)
- Speak clearly at normal pace
- Reduce background noise
English, Bulgarian, Spanish, French, German, Italian, Portuguese, Russian, Japanese, Chinese, Arabic, Turkish, Polish, Ukrainian, and 90+ more!
| Feature | Vosk (Free) | Google API | Whisper AI |
|---|---|---|---|
| Accuracy | 70% | 85% | 95%+ |
| Bulgarian | ❌ | ✅ | ✅ |
| Offline | ✅ | ❌ | ❌ |
| Cost | Free | Free (limited) | $0.006/min |
| Quality | Basic | Good | ChatGPT-level |
"Invalid API key"
- Make sure you copied the full key (starts with
sk-) - Check you have credits in your OpenAI account
"Insufficient quota"
- Add payment method at: https://platform.openai.com/account/billing
- New accounts get $5 free credit
Slow transcription
- Lower chunk duration to 3-4 seconds
- Check your internet connection
Poor accuracy
- Set specific language instead of
auto - Speak closer to microphone
- Reduce background noise