Skip to content

This project provides a tool for transcribing audio files using the Whisper model API. The audio file is divided into manageable chunks, each chunk is transcribed, and the transcriptions are combined into a final document in Markdown format.

Notifications You must be signed in to change notification settings

LuisMiguelMedina/AudioToText-WhisperAPI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Transcription Project

Requirements

Make sure you have the following packages installed:

pip install whisper markdown2 pydub tqdm

Additionally, you need to have FFmpeg installed and available in your PATH.

Usage

Instructions

  1. Clone the repository:
git clone https://github.com/your_username/your_repository.git
cd your_repository
  1. Install the dependencies:
pip install whisper markdown2 pydub tqdm
  1. Ensure FFmpeg is installed:

Check if FFmpeg is in the PATH by running:

ffmpeg -version

If FFmpeg is not installed, follow the instructions on FFmpeg to install it.

  1. Place your audio file in the specified path:

By default, the audio file should be in ./Audios/audio.wav. Make sure to update audio_file_path in the script if your audio file is in a different location.

  1. Run the script:
python transcribe.py

About

This project provides a tool for transcribing audio files using the Whisper model API. The audio file is divided into manageable chunks, each chunk is transcribed, and the transcriptions are combined into a final document in Markdown format.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages