I'm Nandan K S a passionate Full-Stack Developer and MCA student at CMR Institute of Technology, Bengaluru.
I build scalable web applications and AI-powered solutions that solve real-world problems from real-time video conferencing platforms with on-the-fly meeting summaries to fully offline voice assistants that keep your data private.
I'm captivated by the intersection of real-time systems, AI / NLP, and clean software architecture. For me, writing great software isn't just about making things work it's about making them work beautifully.
"Let's build something impactful together."
- π MCA Student CMR Institute of Technology, Bengaluru
- π Currently building AI-integrated full-stack applications
- π± Exploring real-time systems, WebRTC, NLP & AI integration
- π‘ Passionate about privacy-first tech, developer tools, and elegant UX
- π€ Open to collaborations, new opportunities, and interesting ideas
An intelligent pipeline that transforms any YouTube video into AI-generated summaries, mind maps, and quizzes using Google Gemini + Flask.
| Feature | Detail |
|---|---|
| π AI Summary | Adjustable length (10β50%), cached with SQLite TTL |
| πΊοΈ Mind Map | Mermaid-format visual maps generated from transcript |
| π Quiz | Auto-generated Q&A from video content |
| π Chrome Extension | In-browser video analysis without leaving the tab |
| β‘ Real-time Updates | Socket.IO processing progress indicators |
Python Β· Flask Β· Google Gemini API Β· yt-dlp Β· Socket.IO Β· SQLite
A full-featured video conferencing platform with real-time AI transcription, smart meeting summaries, and ML-based presence detection.
| Feature | Detail |
|---|---|
| πΉ HD Video | WebRTC peer-to-peer video & audio with screen sharing, grid/speaker views |
| π€ AI Summaries | Auto-generated meeting summaries & action items via Google Gemini |
| π€ Face Detection | MediaPipe ML β auto-away detection + missed conversation catch-up |
| π¬ Live Chat | Real-time messaging with typing indicators and emoji support |
| ποΈ Persistence | MongoDB meeting & chat storage, full REST API + WebSocket events |
React 18 Β· Node.js Β· WebRTC Β· Socket.IO Β· MongoDB Β· Google Gemini AI Β· MediaPipe
Broadcast audio instantly over the web. One device streams; everyone listens through their own headphones β zero Bluetooth, zero extra hardware.
| Feature | Detail |
|---|---|
| π‘ Audio Capture | Browser Tab Audio (getDisplayMedia) or Microphone (getUserMedia) |
| π Instant Sharing | Unique 7-character room code + QR code generated on demand |
| ποΈ Host Controls | Pause, resume, stop broadcast, remove individual listeners |
| π Security | Helmet headers, CORS restriction, rate limiting, stale-room cleanup |
| π P2P Transport | WebRTC direct audio β no audio data ever touches the server |
React Β· Vite Β· TailwindCSS Β· Node.js Β· Socket.IO Β· WebRTC Β· Web Audio API
A 100% offline, privacy-first Windows voice assistant powered by OpenAI Whisper and a 4-layer NLP pipeline. Your voice, your control, your data.
| Feature | Detail |
|---|---|
| π€ Offline STT | OpenAI Whisper or Vosk β no internet, no cloud, no data collection |
| π§ 4-Layer NLP | Exact β Fuzzy β Phonetic β Semantic matching with weighted scoring |
| π‘οΈ Safety | 4-gate validation + command sandboxing (whitelist-only execution) |
| π₯οΈ System Control | Open/close any app, smart window & tab control, 40+ keyboard commands |
| β‘ Performance | Threaded execution, interruptible TTS, startup prewarming, accent support |
Python Β· OpenAI Whisper Β· PyAutoGUI Β· pyttsx3 Β· NLP Pipelines Β· Windows OS
β Thanks for stopping by! Feel free to browse my repos, star something useful, or reach out if you'd like to collaborate on something impactful.