A powerful application that generates transcripts from both YouTube videos and local video files. The application uses OpenAI's Whisper model for local video transcription and YouTube's API for online videos.
-
YouTube Video Processing
- Extract transcripts from YouTube videos
- Generate text summaries
- Export in Markdown format
- Support for multiple languages
-
Local Video Processing
- Convert MP4 videos to transcripts
- Automatic MP4 to MP3 conversion
- Timestamped transcripts in Markdown format
- Organized output in timestamped directories
- Python 3.8 or higher
- FFmpeg installed and added to system PATH
- Windows/Linux/MacOS
- Clone the repository:
git clone https://github.com/yourusername/transcript_generator.git
cd transcript_generator- Create and activate a virtual environment:
# Windows
python -m venv venv
.\venv\Scripts\activate
# Linux/MacOS
python3 -m venv venv
source venv/bin/activate- Install dependencies:
pip install -r requirements.txt- Install FFmpeg:
- Windows:
- Download from https://www.gyan.dev/ffmpeg/builds/
- Extract and rename folder to 'ffmpeg'
- Move to C:\ drive
- Add C:\ffmpeg\bin to system PATH
- Linux:
sudo apt-get install ffmpeg - MacOS:
brew install ffmpeg
- Windows:
- Start the application:
python transcript_generator_ui.py- The application has two main features:
- Select the "YouTube Video" tab
- Paste the YouTube video URL
- Choose either:
- "Generate Transcript & Summary" for full processing
- "Export Transcript Only" for just the transcript
- Wait for processing to complete
- Find the output files in the generated timestamp directory
- Select the "Local Video" tab
- Click "Browse Video File" to select an MP4 file
- Click "Generate Transcript"
- Wait for processing to complete
- Find the transcript in the generated timestamp directory
video_id_transcript.md: Full transcriptvideo_id_summary.md: Text summary- Files are organized in a directory with the video ID
filename_transcript.md: Transcript with timestamps- Organized in directories named
filename_YYYYMMDD_HHMMSS - Timestamps for each segment of speech
-
FFmpeg Error:
- Ensure FFmpeg is properly installed
- Verify system PATH includes FFmpeg
- Restart application after installation
-
Memory Issues:
- Close other applications
- For large videos, ensure sufficient free memory
- Consider using a machine with more RAM
-
File Access Errors:
- Run the application with appropriate permissions
- Ensure write access to the output directory
- Fork the repository
- Create a feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- OpenAI's Whisper model for transcription
- YouTube API for video processing
- MoviePy for video conversion
- All other open-source contributors