AI-Powered Voice-to-Workflow Automation Platform
Transform your voice commands into automated workflows with visual graph interface. VoiceGraph uses advanced AI to parse natural language, generate executable workflows, and orchestrate complex automation tasks across multiple services.
- Voice-to-Workflow: Speak your automation needs, get visual workflows
- Visual Workflow Builder: Drag-and-drop interface with React Flow
- Real-time Execution: Live progress tracking with node highlighting
- Background Processing: Async execution with complete history tracking
- Multi-Service Integration: Notion, Tavily, Email, GitHub, and more
- Cerebras LLM: Ultra-fast workflow parsing and content processing
- Groq Whisper: High-quality speech-to-text transcription
- Smart Parsing: Natural language understanding for complex workflows
- Parallel Execution: Optimized workflow processing
- Real-time Streaming: Live execution logs and progress updates
- Workflow History: Complete audit trail of all executions
- Voice Editing: Modify workflows using voice commands
┌─────────────────┐ ┌─────────────────┐
│ Frontend │ │ AI Services │
│ (Next.js) │◄──►│ (Cerebras) │
│ │ │ (Groq) │
└─────────────────┘ └─────────────────┘
│ │
│ │
▼ ▼
┌─────────────────┐ ┌─────────────────┐
│ Workflow │ │ Notion API │
│ Execution │ │ Integration │
│ Engine │ │ Tavily API │
└─────────────────┘ └─────────────────┘
- Voice Input → Groq Whisper → Text
- Text → Cerebras LLM → Workflow JSON
- Workflow → React Flow → Visual Graph
- Execution → Direct API Calls → Service APIs
- Results → Real-time UI Updates
- Node.js 18+
- API Keys (see Environment Variables)
git clone <repository-url>
cd voicegraphCreate .env file in the root directory:
# Cerebras AI (Required for prize eligibility)
CEREBRAS_API_KEY=your_cerebras_api_key_here
# Groq (Speech-to-Text)
GROQ_API_KEY=your_groq_api_key_here
# Notion Integration
NOTION_API_KEY=your_notion_integration_key_here
# Tavily (Web Search)
TAVILY_API_KEY=your_tavily_api_key_here
# SMTP (Email)
SMTP_HOST=smtp.gmail.com
SMTP_PORT=587
SMTP_USER=your_email@gmail.com
SMTP_PASSWORD=your_app_password
# Development
NODE_ENV=development# Start Frontend
cd frontend
npm run dev- Frontend: http://localhost:3000
- Voice: Click mic button and speak your automation needs
- Text: Type your requirements in the text input
- Templates: Use pre-built workflow templates
- Enter required parameters (Notion page IDs, email addresses, etc.)
- Review the generated workflow graph
- Modify nodes if needed
- Real-time: Click "Run Workflow" for live execution
- Background: Toggle "Background Execution" for async processing
- Monitor: Watch real-time progress and logs
- Execution Logs: Real-time progress updates
- Workflow History: Complete audit trail
- Node Status: Visual indicators for success/failure
POST /api/transcribe- Speech-to-text conversionPOST /api/parse- Natural language to workflow parsingPOST /api/execute- Real-time workflow executionPOST /api/execute-background- Background workflow executionPOST /api/edit-workflow- Voice-based workflow editing
GET /api/health- System health checkGET /api/workflow-history- Execution history
voicegraph/
├── frontend/ # Next.js application
│ ├── app/ # App router pages
│ ├── components/ # React components
│ ├── lib/ # Utilities and integrations
│ └── public/ # Static assets
└── README.md # This file
- Frontend: Next.js 14, React 18, Tailwind CSS, React Flow
- Backend: Node.js, Express.js, Server-Sent Events
- AI: Cerebras LLM, Groq Whisper
- Integrations: Notion API, Tavily API, SMTP
# Start Frontend
cd frontend && npm run dev
# Test connections
curl http://localhost:3000/api/healthVoice: "Get my Notion meeting notes and email me a summary" Workflow:
- Fetch Notion page
- Summarize content with AI
- Send email with summary
Voice: "Research hotels in Paris and create a Notion page with the results" Workflow:
- Search web for hotels
- Process and format data
- Create Notion page with results
Voice: "Find trending topics, research them, and send a daily digest" Workflow:
- Search for trending topics
- Research each topic
- Summarize findings
- Create formatted report
- Send email digest
- All API keys stored in environment variables
- No hardcoded credentials in codebase
- Verify API keys are set correctly
- Check console logs for errors
- Ensure microphone permissions are granted
- Check Groq API key is valid
- Verify network connectivity
- Verify workflow history API is accessible
- Check console logs for detailed errors
# Test API endpoints
curl http://localhost:3000/api/health- Frontend: http://localhost:3000/api/health
- Frontend: Browser console + terminal
This project is designed for hackathon competition with the following sponsor requirements:
- Primary LLM for workflow parsing
- Content processing and summarization
- Models:qwen-3-32b
- Using Llama models through Cerebras
- Covered by Cerebras integration
- Fork the repository
- Create a feature branch
- Make your changes
- Test thoroughly
- Submit a pull request
- Cerebras AI for ultra-fast LLM inference
- Groq for high-quality speech-to-text
- Notion and Tavily for API integrations
- React Flow for workflow visualization
VoiceGraph - Transform your voice into powerful automation workflows! 🎤✨