Telegram bot that transcribes voice messages and audio files using Whisper.
- Transcribe voice messages and audio files
- Auto-summarization for audio over 3 minutes
- Multi-language support (Spanish, English)
- Export transcriptions as text files
- Configure environment
cp .env.example .env- Edit
.envwith your credentials
TELEGRAM_BOT_TOKEN=your_token_here
OPENAI_API_KEY=your_key_here
ENVIRONMENT=development- Run the bot
docker compose --profile dev up -d- Check logs
docker compose --profile dev logs -fCreate a .env file with the following variables:
| Variable | Required | Description |
|---|---|---|
TELEGRAM_BOT_TOKEN |
Yes | Bot token from @BotFather |
OPENAI_API_KEY |
Yes | OpenAI API key for transcription and summarization |
ENVIRONMENT |
No | production (OpenAI API) or development (local Whisper). Default: development |
WHISPER_MODEL |
No | Local model size: tiny, base, small, medium, large-v3. Default: base |
WHISPER_DEVICE |
No | Device for local model: cpu, cuda, auto. Default: cpu |
/start- Welcome message/command- Show all available commands/setlang- Change language/history- View transcription history
See DEPLOY.md for VM deployment guide.
MIT