A powerful Python-based voice assistant that transforms your Windows PC into a smart, voice-controlled system. Named "Vecna" for its comprehensive system control capabilities and AI-powered intelligence.
- Advanced Voice Recognition: Multi-layered recognition with Google Speech API and Whisper AI
- Natural Text-to-Speech: High-quality voice responses using pyttsx3
- No Wake Word: Continuous listening for seamless interaction
- AI Integration: OpenAI GPT and Google Gemini support for intelligent conversations
- Memory System: Persistent conversation memory and learning
- Application Management: Open, close, and control any Windows application
- Window Management: Switch, minimize, maximize, and arrange windows
- Process Control: Monitor and manage running processes
- File Operations: Create folders, find files, manage directories
- System Functions: Shutdown, restart, sleep, lock, volume control
- GUI Interface: Modern dark-theme interface with system tray integration
- Plugin System: Extensible architecture with custom plugin support
- Screen Capture: Screenshots and screen recording capabilities
- Timer System: Set multiple timers with custom messages
- Network Monitoring: WiFi status, network diagnostics
- System Monitoring: CPU, memory, disk, battery status
- Weather Plugin: Real-time weather information
- Calculator Plugin: Advanced mathematical calculations
- Todo Plugin: Task management and reminders
- Mouse Control: Precise coordinate-based clicking, dragging, scrolling
- Web Automation: Browser automation with Selenium and Beautiful Soup
- Enhanced App Launching: Smart detection for WhatsApp, DaVinci Resolve, Discord
- Window Focus: Advanced window management and switching
- File Operations: Automated file copying, moving, and organization
- Computer Vision: Screen analysis and image recognition capabilities
-
Run the startup script:
start_vecna.batThis will automatically set up the environment and install dependencies.
-
For Advanced Automation Features:
install_advanced_automation.batThis installs mouse control, web automation, and enhanced app launching.
-
Choose your preferred mode:
- Enhanced Vecna with GUI (recommended)
- Enhanced Vecna without GUI
- Simple Vecna (basic features)
-
Install Dependencies:
pip install -r requirements_complete.txt
-
Test Installation:
python test_vecna.py
-
Run Enhanced Vecna:
python vecna_enhanced.py
- "open notepad" / "open chrome" / "open calculator"
- "close window" / "close notepad"
- "switch to chrome"
- "minimize window" / "minimize current"
- "list windows"
- "screenshot" / "screenshot window chrome"
- "record screen for 30 seconds"
- "volume up" / "volume down" / "mute"
- "shutdown computer" / "restart computer"
- "lock computer" / "sleep computer"
- "list processes"
- "kill process notepad"
- "detailed system info"
- "network status"
- "wifi networks"
- "create folder Documents/NewProject"
- "find file document.txt"
- "type hello world"
- "search for python tutorials"
- "set timer 5 minutes coffee break"
- "list timers"
- "cancel timer coffee"
- "what's the weather in New York" (Weather plugin)
- "calculate 25 * 30 + 15" (Calculator plugin)
- "click at 500 300" - Click at specific coordinates
- "move mouse to 100 200" - Move mouse to position
- "drag from 100 100 to 200 200" - Drag operation
- "scroll up" / "scroll down" - Scroll actions
- "open whatsapp" - Launch WhatsApp with smart detection
- "open davinci" - Launch DaVinci Resolve
- "open discord" - Launch Discord
- "open telegram" - Launch Telegram
- "browse to google.com" - Open website in browser
- "search for python tutorials" - Automated web search
- "fill form field username with john" - Form automation
- "click button submit" - Web element interaction
- "window list" - List all open windows
- "focus notepad" - Focus specific window
- "minimize chrome" - Minimize specific application
- "create folder documents" - Create directories
- "copy file to backup" - File operations
- "delete temp files" - Cleanup operations
- "add task buy groceries" (Todo plugin)
- "list plugins"
- "enable plugin weather"
- "pause listening"
- "resume listening"
- "show gui"
- "hide gui"
- "what is artificial intelligence"
- "tell me a joke"
- "explain quantum computing"
- "write a short story"
Create a .env file for AI features:
OPENAI_API_KEY=your_openai_api_key_here
GEMINI_API_KEY=your_gemini_api_key_here
WEATHER_API_KEY=your_weather_api_key_here
Customize settings in config.json:
- Speech recognition sensitivity
- AI model preferences
- GUI themes and appearance
- Plugin settings
- Hotkey combinations
vecna-assistant/
├── vecna.py # Core voice assistant
├── vecna_enhanced.py # Enhanced version with all features
├── vecna_simple.py # Lightweight version
├── vecna_gui.py # GUI interface
├── vecna_plugin_system.py # Plugin architecture
├── advanced_system_control.py # Advanced Windows control
├── start_vecna.bat # Easy startup script
├── test_vecna.py # System test script
├── config.json # Configuration file
├── requirements_complete.txt # All dependencies
├── vecna_memory.json # Conversation memory
└── plugins/ # Plugin directory
├── weather_plugin.py
├── calculator_plugin.py
└── todo_plugin.py
- Full Feature Set: GUI, plugins, advanced system control
- Best For: Power users wanting maximum functionality
- Requirements: All dependencies from
requirements_complete.txt
- Lightweight: Core features with minimal dependencies
- Best For: Basic voice control and older systems
- Requirements: Only core dependencies
- Balanced: Full voice features without GUI overhead
- Best For: Users wanting comprehensive voice control
- Requirements: Standard dependencies
PyAudio Installation Problems:
pip install pipwin
pipwin install pyaudioWindows-specific Dependencies:
pip install pywin32
pip install comtypes-
Check Microphone Permissions
- Windows Settings → Privacy → Microphone
- Enable microphone access for applications
-
Adjust Recognition Settings
- Modify
energy_thresholdinconfig.json - Try different recognition engines (Google, Whisper)
- Modify
-
Network Issues
- Ensure stable internet connection
- Whisper works offline as fallback
-
Use Appropriate Version
- Simple version for basic functionality
- Enhanced version for full features
-
Adjust Settings
- Lower speech recognition timeout
- Disable unused plugins
- Reduce GUI update frequency
GUI Not Starting:
- Install tkinter:
pip install tk - Install Pillow:
pip install Pillow
Plugin Errors:
- Check plugin directory permissions
- Verify plugin dependencies
- Disable problematic plugins in config
- Local Processing: Core functionality works offline
- API Usage: AI features require internet (optional)
- Data Storage: Conversations stored locally in JSON format
- Permissions: Requires microphone and system control permissions
- OS: Windows 10/11 (64-bit recommended)
- Python: 3.8 or higher
- RAM: 4GB minimum, 8GB recommended
- Storage: 500MB for full installation
- Audio: Working microphone and speakers
- Network: Internet connection for AI features (optional)
pip install -r requirements_complete.txt --upgradecopy config.json config_backup.json
copy vecna_memory.json memory_backup.json- Add new plugins to the
plugins/directory - Enable/disable plugins via voice commands or GUI
- Create custom plugins using the provided template
- Fork the Repository
- Create Feature Branch:
git checkout -b feature/amazing-feature - Commit Changes:
git commit -m 'Add amazing feature' - Push to Branch:
git push origin feature/amazing-feature - Open Pull Request
- Use
vecna_plugin_system.pyas reference - Follow the VecnaPlugin base class structure
- Submit plugins for inclusion in the main project
This project is licensed under the MIT License - see the LICENSE file for details.
- OpenAI for GPT API
- Google for Speech Recognition API
- OpenAI Whisper for offline speech recognition
- Python Community for excellent libraries
- Issues: Report bugs via GitHub Issues
- Questions: Start a GitHub Discussion
- Features: Submit feature requests with detailed descriptions
Made with ❤️ for voice-controlled computing
- Customizable: Add your own commands and configure settings
- Install Python 3.8+ if you don't have it already
- Install required packages:
pip install -r requirements.txt - Optional: Set your API keys in the Config class inside
vecna.pyif you want to use AI features - Run the assistant:
python vecna.py
- Start the assistant by running
python vecna.py - Say "Hey Vecna" to activate
- When Vecna responds with "Yes?", speak your command
- Example commands:
- "Open Chrome"
- "Open downloads folder"
- "What's the time?"
- "Set volume to 50 percent"
- "Search for weather in New York"
- "Take a screenshot"
- "Lock my computer"
- "Type hello world"
- "Copy this text"
Edit the Config class in vecna.py to:
- Change the wake words
- Configure voice settings
- Set API keys
- Add custom application paths
- Enable/disable features
- Windows 10/11
- Python 3.8+
- Working microphone
- Internet connection (for some features)
- Some commands may require admin privileges
- Speech recognition accuracy varies with microphone quality and background noise
- Wake word detection may sometimes trigger on similar sounds
- GUI interface with system tray icon
- More AI integrations
- Custom plugin system
- Better offline support