🎙️ CVAudioStudio - Professional Text-to-Speech Converter

A professional, web-based text-to-speech converter powered by OpenAI's TTS API. Perfect for creating audio versions of CVs, interview preparation materials, study notes, and practice questions.

✨ Features

🎤 Multiple Voice Options - Choose from 6 different professional voices
🔊 3 Quality Models - Economy, Standard, and Premium TTS models
⚡ Speed Control - Adjust playback speed from 0.25x to 4.0x
📊 Cost Estimation - Real-time cost calculator for informed decisions
📜 History Tracking - View, play, and manage all generated audio files
💾 Multiple Formats - Support for MP3, Opus, AAC, FLAC, WAV, PCM
🎯 Character Counter - Live character/word count and duration estimation
🚀 Easy Deployment - One-click deployment to Streamlit Cloud

🎯 Use Cases

Professional Audio - Convert CVs and resumes to audio
Interview Prep - Practice with audio versions of common interview questions
Study Materials - Listen to notes, textbooks, and study guides
Language Learning - Hear correct pronunciation of foreign language texts
Accessibility - Create audio versions of written content
Content Creation - Generate voiceovers for presentations and videos

🚀 Quick Start

Prerequisites

Python 3.8 or higher
OpenAI API key (Get one here)

Installation

Clone the repository

git clone https://github.com/yourusername/CVAudioStudio.git
cd CVAudioStudio

Install dependencies

pip install -r requirements.txt

Set up your API key

cp .env.example .env
# Edit .env and add your OpenAI API key

Run the application

streamlit run streamlit_app.py

Open your browser Navigate to http://localhost:8501

📖 Usage

Generate Audio

Enter your text - Paste your CV, study material, or any text (up to 5000 characters)
Choose a voice - Select from 6 professional voice options
Select model - Economy (cheapest), Standard, or Premium (best quality)
Adjust speed - Control playback speed (0.25x - 4.0x)
Generate - Click "Generate Audio" and wait a few seconds
Download - Play in browser or download the audio file

View History

Click "📜 View History" in the sidebar
See all previously generated audio files
Play, download, or delete files
Search and sort by date or size

💰 Pricing

CVAudioStudio uses OpenAI's TTS API with transparent pricing:

Model	Price	Quality	Best For
gpt-4o-mini-tts	$5.00/1M chars	High	Most use cases
tts-1	$15.00/1M chars	Standard	Professional audio
tts-1-hd	$30.00/1M chars	Premium	Best quality

Example costs:

Typical CV (500 chars): ~$0.01 (Economy model)
Study notes (2000 chars): ~$0.03 (Economy model)
Interview prep (1000 chars): ~$0.02 (Economy model)

🎨 Available Voices

Voice	Gender	Accent	Description
alloy	Neutral	American	Clear and articulate
echo	Male	American	Deep and authoritative
fable	Male	British	Warm and engaging
onyx	Male	American	Confident and professional
nova	Female	American	Friendly and clear
shimmer	Female	American	Expressive and warm

📁 Project Structure

CVAudioStudio/
├── streamlit_app.py          # Main application
├── pages/
│   └── 1_📜_History.py      # History page
├── utils/
│   └── audio_generator.py    # TTS generation logic
├── config/
│   ├── openai_voices.py     # Voice configurations
│   └── voices.py            # Voice metadata
├── audio/                    # Generated audio files
├── text/                     # Sample text files
├── logs/                     # Application logs
├── requirements.txt          # Python dependencies
├── .env.example             # Environment variables template
├── README.md                # This file
├── QUICKSTART.md            # 5-minute setup guide
├── DEPLOYMENT.md            # Deployment instructions
└── DEPLOYMENT_COMPARISON.md # FastAPI vs Streamlit comparison

🚀 Deployment

Streamlit Cloud (Recommended - Free & Easy)

Push code to GitHub

git add .
git commit -m "Initial commit"
git push origin main

Deploy to Streamlit Cloud
- Go to share.streamlit.io
- Click "New app"
- Connect your GitHub repository
- Select streamlit_app.py
- Add your OpenAI API key in "Secrets"
- Deploy!

Your app will be live at: https://cvaudiostudio.streamlit.app

For detailed deployment instructions, see DEPLOYMENT.md.

🛠️ Development

Local Development

# Install development dependencies
pip install -r requirements.txt

# Run with auto-reload
streamlit run streamlit_app.py --server.runOnSave true

# View logs
tail -f logs/app.log

Adding New Voices

Edit config/openai_voices.py to add custom voice configurations.

Customizing Models

Modify utils/audio_generator.py to add new model options.

📚 Documentation

Quick Start Guide - Get running in 5 minutes
Deployment Guide - Complete deployment instructions
Deployment Comparison - FastAPI vs Streamlit

🔧 Configuration

Environment Variables

Create a .env file:

# Required
OPENAI_API_KEY=your_api_key_here

# Optional
OPENAI_ORG_ID=your_organization_id
LOG_LEVEL=INFO
MAX_TEXT_LENGTH=5000

Streamlit Configuration

Edit .streamlit/config.toml to customize:

[theme]
primaryColor = "#1f77b4"
backgroundColor = "#ffffff"

[client]
showErrorDetails = false

[logger]
level = "info"

🐛 Troubleshooting

Common Issues

"OpenAI API key not found"

Ensure .env file exists with your API key
Restart the application after adding the key

"Error generating audio"

Check your OpenAI API key is valid
Verify you have sufficient API credits
Check the logs in logs/app.log

Audio not playing

Try a different browser
Check browser console for errors
Ensure audio format is supported

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for the TTS API
Streamlit for the amazing framework
The open-source community

📧 Support

For issues, questions, or suggestions:

Open an issue on GitHub
Email: your.email@example.com
Twitter: @yourusername

⭐ Show Your Support

If you find this project useful, please consider:

⭐ Starring it on GitHub
🐦 Sharing it on Twitter
💬 Telling your friends and colleagues

Built with ❤️ using Streamlit and OpenAI TTS API

⬆ Back to Top

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🎙️ CVAudioStudio - Professional Text-to-Speech Converter

✨ Features

🎯 Use Cases

🚀 Quick Start

Prerequisites

Installation

📖 Usage

Generate Audio

View History

💰 Pricing

🎨 Available Voices

📁 Project Structure

🚀 Deployment

Streamlit Cloud (Recommended - Free & Easy)

🛠️ Development

Local Development

Adding New Voices

Customizing Models

📚 Documentation

🔧 Configuration

Environment Variables

Streamlit Configuration

🐛 Troubleshooting

Common Issues

🤝 Contributing

📝 License

🙏 Acknowledgments

📧 Support

⭐ Show Your Support

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

🎙️ CVAudioStudio - Professional Text-to-Speech Converter

✨ Features

🎯 Use Cases

🚀 Quick Start

Prerequisites

Installation

📖 Usage

Generate Audio

View History

💰 Pricing

🎨 Available Voices

📁 Project Structure

🚀 Deployment

Streamlit Cloud (Recommended - Free & Easy)

🛠️ Development

Local Development

Adding New Voices

Customizing Models

📚 Documentation

🔧 Configuration

Environment Variables

Streamlit Configuration

🐛 Troubleshooting

Common Issues

🤝 Contributing

📝 License

🙏 Acknowledgments

📧 Support

⭐ Show Your Support