Skip to content

Releases: Eamon2009/AI-agent

AI Voice Agent v1.0.0 — Initial Stable Release

20 Feb 07:42

Choose a tag to compare

Overview

This is the initial pre-release of a lightweight Python-based voice assistant that combines speech recognition, system automation, and OpenAI-powered responses.

The assistant listens through the microphone, executes predefined system commands, and uses AI to handle general queries when no specific command is matched.

This version is functional but under active development.

Features
Voice Interaction

Speech recognition using speech_recognition

Audio-to-text via Google Speech API

Text-to-speech using pyttsx3

System Automation

Open websites:

YouTube

Google

Wikipedia

GitHub

Stack Overflow

Launch Windows applications:

Calculator

Notepad

Camera

Get current system time

Exit using voice command

AI Integration

Fallback to OpenAI GPT model for general questions

Generates conversational responses when no predefined rule is matched

Memory Feature

Automatically saves AI responses

Conversations stored as .txt files in the Openai/ directory

Requirements

Python 3.9+

Windows OS

Working microphone

Internet connection

Install dependencies:

pip install -r requirements.txt

Create config.py:

apikey = "your_openai_api_key"

Run the application:

python main.py
Current Limitations

Windows-only (uses SAPI5 and Windows system commands)

Limited command set (rule-based logic)

Requires internet for speech recognition and AI responses

Basic error handling

Development Status

This release is an early development version and may change significantly in future updates.

Roadmap

Modular command/plugin system

Cross-platform support (Linux/macOS)

Wake-word detection

Improved conversation context and memory

Local model integration

Version

v0.1.0 — Initial Pre-release