A local, privacy-focused web application that uses advanced AI models to transcribe, differentiate speakers (diarization), and summarize audio/video content.
Before setting up the project, ensure you have the following installed on your Windows system:
- Python 3.10 or 3.11 (recommended)
- Download Python
- Check "Add Python to PATH" during installation.
- Microsoft Visual C++ Build Tools (Required for some AI libraries)
- Download Build Tools
- Install "Desktop development with C++" workload.
- FFmpeg (Required for audio processing)
- Download FFmpeg
- Extract and add the
binfolder to your System PATH. - Verify by opening CMD and typing:
ffmpeg -version
- MongoDB Community Server
- Download MongoDB
- Install as a Service.
-
Clone the Project
cd "path\to\AI-summerizztion" -
Create Virtual Environment
python -m venv venv .\venv\Scripts\activate -
Install Dependencies
pip install -r requirements.txt -
Enable GPU Support (Recommended) To make transcription 10x-50x faster (requires NVIDIA GPU):
# Uninstall CPU torch first pip uninstall torch torchvision torchaudio -y # Install CUDA 11.8 version pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
-
Setup Authentication (Important)
A. HuggingFace Token (For Diarization)
- Get a token from Hugging Face.
- Accept user agreements for
pyannote/speaker-diarization-3.1andpyannote/segmentation-3.0. - Create a
.envfile in the root directory:HF_TOKEN=hf_yourtokenhere...
B. YouTube Cookies (For Downloads) Fixes "Sign in to confirm you’re not a bot" errors.
- Install "Get cookies.txt LOCALLY" extension on Chrome/Firefox.
- Go to YouTube (logged in).
- Export cookies and save the file as
cookies.txtin this project folder (d:\VS Code\Python\AI-summerizztion\cookies.txt).
-
Activate Environment (if not active)
.\venv\Scripts\activate
-
Start the App
python app.py
-
Open in Browser
- Go to: http://127.0.0.1:5000
Note: The first run will download models (approx 2GB). Please be patient.
DownloadError: Sign in to confirm...:- Missing
cookies.txt. See Step 5B above.
- Missing
ModuleNotFoundError:- Ensure
venvis activated.
- Ensure
torchcodecWarning:- You can safely ignore "torchcodec is not installed correctly" warnings if the app starts.