🎤 Transcriber App - Solution for WhatsApps's unavailable regional Language Transcription

A powerful, intelligent audio transcription application built with love for accessibility 💙

Features • Installation • Usage • Deployment • License

💝 A Personal Story

This project holds a special place in my heart. It was created for my father, who is hearing impaired, to help him convert audio messages, voice notes, and recordings into readable text. Watching him struggle to understand audio content inspired me to build something that could make his daily life easier.

This is not just another project in my portfolio—it's one of my most cherished creations throughout my entire development journey. Every line of code was written with purpose, every feature designed with empathy, and every bug fixed with determination.

This app represents more than technology; it represents the power of using our skills to make a meaningful difference in the lives of those we love.

🌟 Features

⚡ Core Functionality

🎙️ Real-time Audio Recording - Record directly from your device
📁 File Upload Support - Import existing audio files from whatsapp (.opus)
🤖 AI-Powered Transcription - Powered by Google Gemini 2.5 with triple-fallback system
💾 History Management - Save, view, and manage all your transcriptions
🗑️ Smart Garbage Detection - Automatically filters out accidental or empty recordings
🔒 Secure API - Protected endpoints with secret-based authentication

🛡️ Advanced Features

Triple Safety Net Architecture:
- Primary: Gemini 2.5 Pro (High Intelligence)
- Secondary: Gemini 2.5 Flash (High Speed)
- Tertiary: Gemini 2.5 Flash Lite (Lightweight Backup)
Intelligent Error Handling - Network timeouts, connectivity checks, and graceful failures
Cross-Platform - Built with Flutter for Android (iOS support possible)
Offline Storage - Local database for transcription history
Material Design 3 - Modern, beautiful UI with accessibility in mind

📸 Demo

🏗️ Architecture

┌─────────────────────────────────────────┐
│         Flutter Mobile App              │
│  (Audio Recording + UI + History)       │
└─────────────┬───────────────────────────┘
              │ HTTPS + API Secret
              ▼
┌─────────────────────────────────────────┐
│       FastAPI Backend (Python)          │
│  (File Processing + API Gateway)        │
└─────────────┬───────────────────────────┘
              │
              ▼
┌─────────────────────────────────────────┐
│     Google Gemini 2.5 API               │
│  (Audio → Text Transcription)           │
└─────────────────────────────────────────┘

🚀 Installation

Prerequisites

Flutter SDK: 3.0 or higher
Python: 3.11 or higher
Google Gemini API Key: Get it from Google AI Studio
Git: For cloning the repository

1️⃣ Clone the Repository

git clone https://github.com/MabelMoncy/TranscriberAppWithServer.git
cd TranscriberAppWithServer

🖥️ Backend Setup

Step 1: Navigate to Backend Directory

cd backend

Step 2: Create Virtual Environment

# Windows
python -m venv myvenv
myvenv\Scripts\activate

# macOS/Linux
python3 -m venv myvenv
source myvenv/bin/activate

Step 3: Install Dependencies

pip install -r requirements.txt

Step 4: Configure Environment Variables

Create a .env file in the backend directory:

GEMINI_API_KEY=your_gemini_api_key_here
APP_SECRET=your_secure_secret_here

Generate a secure secret:

python -c "import secrets; print(secrets.token_urlsafe(32))"

Step 5: Run the Backend Server

uvicorn main:app --reload --host 0.0.0.0 --port 8000

Backend will be available at: http://localhost:8000

📱 Flutter App Setup

Step 1: Navigate to App Directory

cd transcriberapp

Step 2: Install Flutter Dependencies

flutter pub get

Step 3: Configure Environment Variables

Create a .env file in the transcriberapp directory:

SERVER_URL=http://YOUR_LOCAL_IP:8000
API_SECRET=your_secure_secret_here

Important: Replace YOUR_LOCAL_IP with your computer's local IP address (e.g., 192.168.1.100)

Step 4: Run the App

# Check connected devices
flutter devices

# Run on connected device
flutter run

# Or build APK
flutter build apk --release

🌐 Deployment

Backend Deployment (Render)

Note: Since free hosting have limits try other backend hosting platforms like koyeb or try by creating a new email id for render or for other hosting platform you are familiar with.

Create a Render Account: render.com
Create a New Web Service
Connect Your GitHub Repository
Configure Build Settings:
- Build Command: pip install -r requirements.txt
- Start Command: uvicorn main:app --host 0.0.0.0 --port $PORT
Set Environment Variables:
- GEMINI_API_KEY: Your API key
- APP_SECRET: Your secure secret
Deploy!

📖 Detailed Guide: See backend/RENDER_DEPLOYMENT.md

Flutter App Deployment (Play Store)

Generate Release Keystore
Configure Signing
Update Environment Variables with production backend URL
Build Release APK/AAB
Upload to Google Play Console

📖 Detailed Checklist: See transcriberapp/DEPLOYMENT_CHECKLIST.md

🎯 Usage

Recording Audio

Open the app
Tap the microphone button to start recording
Speak clearly
Tap the stop button when finished
Wait for transcription (usually takes time since server is hosted for free)
View your transcribed text!

Uploading Audio Files

Open WhatsApp and choose the voice message you want to transcribe
Long press and share to the app
Click the Start Transcription button
View the transcribed result and you can copy or share it.

Managing History

Tap the history button (clock icon)
View all past transcriptions
Tap any entry to view details. You can View and also hear by taping the play button
For deleting tap the delete button

🛠️ Tech Stack

Frontend (Mobile App)

Flutter - Cross-platform framework
Dart - Programming language
record - Audio recording package
http - API communication
sqflite - Local database
flutter_dotenv - Environment configuration

Backend (API Server)

FastAPI - Modern Python web framework
Uvicorn - ASGI server
Google Generative AI - Gemini API integration
Python-dotenv - Environment management
Pydantic - Data validation

AI & Cloud

Google Gemini 2.5 - Audio transcription
Render - Backend hosting (recommended)
Firebase - (Optional) for analytics

🔐 Security Features

✅ Environment-based configuration (no hardcoded secrets)
✅ API authentication via secret headers
✅ Request timeout protection (3 minutes)
✅ Input validation and sanitization
✅ Garbage audio detection to prevent wasted API calls
✅ HTTPS support for production
✅ Secure keystore for release builds

🐛 Troubleshooting

Backend Issues

Problem: GEMINI_API_KEY not found

Solution: Ensure .env file exists in backend/ directory with valid API key

Problem: 503 Service Unavailable

Solution: Check Gemini API quota and backend logs

Flutter Issues

Problem: Connection failed

Solution: Verify SERVER_URL in .env points to correct backend address

Problem: 401 Unauthorized

Solution: Ensure API_SECRET matches between backend and Flutter app

🤝 Contributing

While this is a personal project, I welcome contributions! If you'd like to help improve it:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 Roadmap

🙏 Acknowledgments

My Father - The inspiration behind this project
Google Gemini Team - For the powerful AI API
Flutter Community - For amazing packages and support
FastAPI Team - For the excellent framework
Everyone who believes in using technology for accessibility

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

👨‍💻 About the Developer

Mabel Moncy

This project represents countless hours of learning, debugging, and determination. It taught me that the best code we write isn't for grades or portfolios—it's for the people we love.

If this project helps you or inspires you, please ⭐ star it on GitHub!

💬 Contact & Support

GitHub: @MabelMoncy
Issues: Report a Bug
Discussions: Ask Questions

Made with ❤️ for accessibility and inclusion

"Technology should empower everyone, regardless of ability."

If this project made a difference in your life or someone you know, I'd love to hear about it!

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
backend		backend
transcriberapp		transcriberapp
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

🎤 Transcriber App - Solution for WhatsApps's unavailable regional Language Transcription

💝 A Personal Story

🌟 Features

⚡ Core Functionality

🛡️ Advanced Features

📸 Demo

🏗️ Architecture

🚀 Installation

Prerequisites

1️⃣ Clone the Repository

🖥️ Backend Setup

Step 1: Navigate to Backend Directory

Step 2: Create Virtual Environment

Step 3: Install Dependencies

Step 4: Configure Environment Variables

Step 5: Run the Backend Server

📱 Flutter App Setup

Step 1: Navigate to App Directory

Step 2: Install Flutter Dependencies

Step 3: Configure Environment Variables

Step 4: Run the App

🌐 Deployment

Backend Deployment (Render)

Flutter App Deployment (Play Store)

🎯 Usage

Recording Audio

Uploading Audio Files

Managing History

🛠️ Tech Stack

Frontend (Mobile App)

Backend (API Server)

AI & Cloud

🔐 Security Features

🐛 Troubleshooting

Backend Issues

Flutter Issues

🤝 Contributing

📝 Roadmap

🙏 Acknowledgments

📄 License

👨‍💻 About the Developer

💬 Contact & Support

Made with ❤️ for accessibility and inclusion

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages