A web app that uses AI to generate transcripts for audio files, using WhisperX and OpenAI's GPT-3.
| Home page | Transcription result |
|---|---|
![]() |
![]() |
| Transcription summary | ChatGPT conversation |
![]() |
![]() |
- Python 3.11
- ffmpeg
- CUDA (optional)
Create a virtual environment and activate it:
python3 -m venv .venv
source .venv/bin/activateInstall the libraries:
pip install git+https://github.com/m-bain/whisperx.git torch flask openaiCopy the .env.example file to .env and fill in the values:
OPENAI_API_KEY: Your OpenAI API keyHUGGINGFACE_TOKEN: Your HuggingFace token
python3 main.py


