Releases: Eamon2009/AI-agent
AI Voice Agent v1.0.0 — Initial Stable Release
Overview
This is the initial pre-release of a lightweight Python-based voice assistant that combines speech recognition, system automation, and OpenAI-powered responses.
The assistant listens through the microphone, executes predefined system commands, and uses AI to handle general queries when no specific command is matched.
This version is functional but under active development.
Features
Voice Interaction
Speech recognition using speech_recognition
Audio-to-text via Google Speech API
Text-to-speech using pyttsx3
System Automation
Open websites:
YouTube
Wikipedia
GitHub
Stack Overflow
Launch Windows applications:
Calculator
Notepad
Camera
Get current system time
Exit using voice command
AI Integration
Fallback to OpenAI GPT model for general questions
Generates conversational responses when no predefined rule is matched
Memory Feature
Automatically saves AI responses
Conversations stored as .txt files in the Openai/ directory
Requirements
Python 3.9+
Windows OS
Working microphone
Internet connection
Install dependencies:
pip install -r requirements.txt
Create config.py:
apikey = "your_openai_api_key"
Run the application:
python main.py
Current Limitations
Windows-only (uses SAPI5 and Windows system commands)
Limited command set (rule-based logic)
Requires internet for speech recognition and AI responses
Basic error handling
Development Status
This release is an early development version and may change significantly in future updates.
Roadmap
Modular command/plugin system
Cross-platform support (Linux/macOS)
Wake-word detection
Improved conversation context and memory
Local model integration
Version
v0.1.0 — Initial Pre-release