Navaa: Persian Podcast Generator

Navaa transforms your Persian voice notes into polished, full-length podcast episodes using OpenAI's suite of APIs in a simple, automated pipeline.

Technologies Used

Python 3.9+ for scripting and orchestration
OpenAI API
- Whisper for speech-to-text (STT)
- gpt-4o-mini for translation (Persian → English) and script generation
- gpt-4o-mini-tts for text-to-speech (TTS)
python-dotenv to manage environment variables securely
Logging via Python's logging module

Pipeline Flow

Speech-to-Text: Transcribe Persian audio notes into text with Whisper.
Translation: Translate the Persian transcript into English for clearer LLM understanding.
Script Generation: Use GPT to draft a conversational Persian podcast script (intro, body, conclusion).
Text-to-Speech: Synthesize the final Persian script into high-fidelity audio.

Getting Started

1. Clone the Repository

git clone https://github.com/YOUR_USERNAME/Navaa.git
cd Navaa

2. Install Dependencies

python -m venv venv
source venv/bin/activate      # macOS/Linux
# or venv\\Scripts\\activate   # Windows
pip install -r requirements.txt

3. Configure Your API Key

Create a file named .env in the project root.
Add your OpenAI key:
```
OPENAI_API_KEY=your_openai_api_key_here
```

4. Prepare Your Input

Place your Persian audio file(s) into the assets/inputs/ directory. For example:

assets/inputs/myvoice.wav

5. Run the Pipeline

Run the CLI tool with your filename (and optional output directory name):

python main.py --filename myvoice.wav [--outputname podcast_episode1]

--filename (-f): Name of the input file in assets/inputs/.
--outputname (-o): (Optional) Custom folder name under assets/outputs/. Defaults to the input file base name.

6. Inspect Outputs

After successful processing, check:

assets/outputs/{outputname}/
├── process.log         # Detailed processing log
├── stt.txt             # Whisper transcript
├── translation.txt     # English translation
├── script.txt          # Generated podcast script (Persian)
└── {outputname}_podcast.wav  # Final podcast audio

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
app		app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Navaa: Persian Podcast Generator

Technologies Used

Pipeline Flow

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Configure Your API Key

4. Prepare Your Input

5. Run the Pipeline

6. Inspect Outputs

About

Uh oh!

Releases

Packages

Languages

License

rezaBarzgar/Navaa

Folders and files

Latest commit

History

Repository files navigation

Navaa: Persian Podcast Generator

Technologies Used

Pipeline Flow

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Configure Your API Key

4. Prepare Your Input

5. Run the Pipeline

6. Inspect Outputs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages