LLM Audio Transcriber (Streamlit + OpenAI)

A web application built with Streamlit to transcribe English audio files using OpenAI's gpt-4o-mini-transcribe model via its API. It handles various audio formats, automatically chunks large files for processing, and displays the resulting transcription.

Features

Audio Upload: Supports various common audio formats (MP3, WAV, M4A, OGG, FLAC, etc.).
Transcription: Uses OpenAI's gpt-4o-mini-transcribe model for accurate English transcription.
Format Conversion: Leverages pydub to handle different input formats.
Automatic Chunking: Splits audio longer than approximately 10 minutes into smaller chunks before sending to the API, allowing processing of large files.
Progress Indicator: Displays a progress bar during the transcription of multiple chunks.
Simple UI: Easy-to-use web interface built with Streamlit.

Technology Stack

Backend: Python
Frontend: Streamlit
Audio Processing: pydub
Transcription Service: OpenAI API
System Dependencies: ffmpeg (required by pydub)

Setup and Run Locally

Follow these steps to run the application on your local machine.

Prerequisites:

Python 3.8 or higher
pip (Python package installer)
Git
ffmpeg:
- Debian/Ubuntu: sudo apt update && sudo apt install ffmpeg
- macOS (Homebrew): brew install ffmpeg
- Windows: Download from ffmpeg.org, install, and add to system PATH.

Installation:

Clone the repository:

# Replace with your actual repository URL if different
git clone [https://github.com/saad-git-007/streamlit-audio-transcriber.git](https://github.com/saad-git-007/streamlit-audio-transcriber.git)
cd streamlit-audio-transcriber

(Or clone from the Hugging Face Space repo if using that as primary)

Create and activate a virtual environment:

python3 -m venv venv
# On Linux/macOS
source venv/bin/activate
# On Windows
.\venv\Scripts\activate

Install Python dependencies:
```
pip install -r requirements.txt
```
Configure OpenAI API Key: You need to provide your OpenAI API key. Choose one method:
- Method A (Recommended for Local): Create a Streamlit secrets file.
  - Create the directory: mkdir .streamlit
  - Create the file: nano .streamlit/secrets.toml (or use another text editor)
  - Add your key inside: OPENAI_API_KEY="sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"
  - Save the file. (Ensure .streamlit/ is in your .gitignore file).
- Method B (Fallback for Local): The app includes a fallback input field in the sidebar to enter the key when run locally if the secrets file is not found.
Run the Streamlit app:
```
streamlit run app.py
```
Open your web browser to the local URL provided (usually http://localhost:8501).

Deployment

This application was deployed using Hugging Face Spaces. The deployment requires the following files:

app.py: The main Streamlit application script.
requirements.txt: Lists Python package dependencies.
packages.txt: Lists system-level dependencies (ffmpeg) to be installed via apt-get on Hugging Face.

For deployment on platforms like Hugging Face Spaces, the OpenAI API key must be configured as a secret named OPENAI_API_KEY within the platform's secrets management settings.

Usage

Launch the application (either locally or via the deployment URL).
If running locally using Fallback Method B, enter your OpenAI API key in the sidebar input box when prompted.
Use the "Choose an audio file..." button to upload an English audio file from your computer.
An audio player will appear to preview the file.
Click the "Transcribe Audio" button.
Wait for the progress bar (if the file is split into chunks) and processing indicators to complete.
The transcription text will appear in the text area below.

License

This project is licensed under the Apache 2.0 License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Audio Transcriber (Streamlit + OpenAI)

Features

Technology Stack

Setup and Run Locally

Deployment

Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
packages.txt		packages.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

LLM Audio Transcriber (Streamlit + OpenAI)

Features

Technology Stack

Setup and Run Locally

Deployment

Usage

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages