🎭 Moodify - AI-Powered Emotion-Aware Voice Chatbot

Moodify is a Dragon Ball Z themed voice chatbot that detects emotions from your voice and responds with mood-appropriate conversations. Talk to an AI that understands how you're feeling and adapts its responses accordingly!

✨ Features

🎤 Voice Emotion Detection - Detects 7 emotions from audio using state-of-the-art ML models

Happy, Sad, Angry, Fear, Surprise, Disgust, Neutral

🤖 AI-Powered Responses - Generates contextual, emotion-aware replies using Groq's LLM

🎨 Dragon Ball Z Theme - Character transformations based on detected emotions

Super Saiyan for Angry 😤
Ultra Instinct for Happy 😊
Base Form for Neutral 😐
And more!

🔊 Voice Activity Detection - Automatically sends audio after 3 seconds of silence

💬 Conversation Memory - Maintains context across the conversation

😊 Graceful Degradation - Works with audio-only; face detection optional

🏗️ Architecture

┌─────────────┐      ┌──────────────┐      ┌─────────────┐
│   Frontend  │─────▶│   Backend    │─────▶│   Groq API  │
│  (React UI) │      │  (FastAPI)   │      │    (LLM)    │
└─────────────┘      └──────────────┘      └─────────────┘
                            │
                            ▼
                     ┌──────────────┐
                     │ HuggingFace  │
                     │ Audio Model  │
                     └──────────────┘
                            │
                            ▼
                     ┌──────────────┐
                     │  CNN Model   │
                     │  (Optional)  │
                     └──────────────┘

📁 Project Structure

moodify/
├── backend/                    # FastAPI backend
│   ├── app/
│   │   ├── api/               # API routes
│   │   ├── core/              # Core utilities
│   │   ├── models/            # ML models & schemas
│   │   ├── services/          # Business logic
│   │   ├── utils/             # Helper functions
│   │   └── main.py            # App entry point
│   ├── trained_models/        # CNN model (optional)
│   ├── requirements.txt
│   └── README.md
│
└── frontend/                  # React frontend
    ├── src/
    │   ├── components/        # UI components
    │   ├── services/          # API clients
    │   └── App.jsx
    ├── public/
    └── package.json

🚀 Quick Start

Prerequisites

Python 3.10+
Node.js 18+ (for frontend)
Groq API Key (Get one free)
Optional: Trained CNN model for face emotion detection

Backend Setup

Clone the repository

git clone https://github.com/ShubrotoDas10/moodify.git
cd moodify/backend

Create virtual environment

python -m venv venv

# Activate on Windows
venv\Scripts\activate

# Activate on Mac/Linux
source venv/bin/activate

Install dependencies

pip install -r requirements.txt

Configure environment

cp .env.example .env

Edit .env and add your Groq API key:

GROQ_API_KEY=your_groq_api_key_here

Run the backend

python -m app.main

Backend will start on http://localhost:8000

Frontend Setup

Navigate to frontend

cd ../frontend

Install dependencies

npm install

Start development server

npm run dev

Frontend will start on http://localhost:5173

🎯 Usage

Open the app in your browser (http://localhost:5173)
Allow microphone access when prompted
Press the mic button and start talking
Stop talking and wait 3 seconds - your message will auto-send
Watch Goku transform based on your emotion!
See the AI response tailored to your mood

📡 API Endpoints

Main Endpoints

Endpoint	Method	Description
`/chat/audio`	POST	Send audio, get emotion + response
`/chat/text`	POST	Text-only chat (fallback)
`/audio/detect-emotion`	POST	Audio emotion detection only
`/health/models`	GET	Check backend status

Example API Call

curl -X POST "http://localhost:8000/chat/audio" \
  -F "audio=@recording.wav"

Response:

{
  "emotion": {
    "emotion": "happy",
    "confidence": 0.87,
    "probabilities": {...}
  },
  "chat_response": {
    "message": "That's wonderful! Keep that positive energy!",
    "emotion_detected": "happy"
  }
}

See full API Documentation

🛠️ Tech Stack

Backend

FastAPI - High-performance web framework
PyTorch - ML model inference
HuggingFace Transformers - Audio emotion detection
OpenCV - Face detection (optional)
Groq API - LLM for response generation
librosa - Audio processing

Frontend

React - UI framework
Web Audio API - Voice recording
Fetch API - Backend communication
CSS3 - Dragon Ball Z themed styling

🎨 Emotion → Character Mapping

Emotion	Character State
😤 Angry	Super Saiyan Goku (Golden aura)
😊 Happy	Ultra Instinct Goku (Silver glow)
😢 Sad	Base Goku (Looking down)
😰 Fear	Injured Goku (Worried)
😲 Surprise	Shocked Goku (Wide eyes)
🤢 Disgust	Annoyed Vegeta (Scowling)
😐 Neutral	Base Goku (Relaxed)

🧪 Testing

Test Backend Health

curl http://localhost:8000/health

Test with Sample Audio

curl -X POST "http://localhost:8000/chat/audio" \
  -F "audio=@test_audio.wav"

Run Unit Tests

cd backend
pytest

📦 Optional: CNN Face Emotion Detection

Moodify works great with audio-only, but you can add face emotion detection:

Train or obtain a CNN model for emotion recognition
Place model file in backend/trained_models/cnn_face_emotion.pth
Restart backend - it will automatically load the CNN

See CNN_OPTIONAL.md for details.

🐛 Troubleshooting

Backend won't start

# Fix NumPy compatibility issue
pip install "numpy<2"
pip install opencv-python-headless==4.9.0.80

Frontend can't connect

Ensure backend is running on port 8000
Check CORS settings in .env
Try http://localhost:8000/health in browser

Microphone not working

Use HTTPS in production (required for mic access)
Check browser permissions
Ensure site is not blocked

CNN model not loading

This is OK! Backend works with audio-only
Check logs for "⚠ CNN model not available"
See CNN_OPTIONAL.md

📖 Documentation

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 Environment Variables

Backend (.env)

GROQ_API_KEY=your_groq_api_key          # Required
HF_TOKEN=your_hf_token                  # Optional
CNN_MODEL_PATH=./trained_models/cnn_face_emotion.pth  # Optional
AUDIO_CONFIDENCE_THRESHOLD=0.70
MAX_AUDIO_SIZE_MB=10
ALLOWED_ORIGINS=http://localhost:3000,http://localhost:5173

🎓 How It Works

Voice Input → User speaks into microphone
VAD → Frontend detects 3 seconds of silence
Audio Sent → WAV file sent to /chat/audio endpoint
Emotion Detection → HuggingFace model analyzes audio
Response Generation → Groq LLM creates mood-appropriate response
Character Update → Frontend displays matching DBZ character
Chat Display → Message shown with emotion badge

🔐 Security Notes

Never commit .env files
Use environment variables for API keys
Enable CORS only for trusted origins
Validate all file uploads
Use HTTPS in production

📊 Performance

Emotion Detection: ~1-2 seconds
Response Generation: ~2-3 seconds
Total Response Time: ~3-5 seconds
Supports: Multiple concurrent users

🚀 Deployment

Backend

Docker: docker-compose up -d
Railway/Render: Connect GitHub repo
AWS/GCP: Use provided Dockerfile

Frontend

Vercel: npm run build → Deploy
Netlify: Connect GitHub repo
Cloudflare Pages: Auto-deploy on push

🌟 Future Enhancements

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

👨‍💻 Author

Shubroto Das

GitHub: @ShubrotoDas10

🙏 Acknowledgments

Groq - For fast LLM inference
HuggingFace - For emotion detection models
Dragon Ball Z - For character inspiration
FastAPI - For the amazing framework

⭐ Star History

📞 Support

If you have questions or need help:

Open an Issue
Check Discussions
Read the Documentation

Made with ❤️ by Shubroto Das

If you found this project helpful, please consider giving it a ⭐!

Report Bug · Request Feature · Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
frontend		frontend
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

🎭 Moodify - AI-Powered Emotion-Aware Voice Chatbot

✨ Features

🏗️ Architecture

📁 Project Structure

🚀 Quick Start

Prerequisites

Backend Setup

Frontend Setup

🎯 Usage

📡 API Endpoints

Main Endpoints

Example API Call

🛠️ Tech Stack

Backend

Frontend

🎨 Emotion → Character Mapping

🧪 Testing

Test Backend Health

Test with Sample Audio

Run Unit Tests

📦 Optional: CNN Face Emotion Detection

🐛 Troubleshooting

Backend won't start

Frontend can't connect

Microphone not working

CNN model not loading

📖 Documentation

🤝 Contributing

📝 Environment Variables

Backend (.env)

🎓 How It Works

🔐 Security Notes

📊 Performance

🚀 Deployment

Backend

Frontend

🌟 Future Enhancements

📜 License

👨‍💻 Author

🙏 Acknowledgments

⭐ Star History

📞 Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages