CodenameProj - Cyra

"Cyra at your service."

An intelligent, voice-enabled AI assistant built with local LLMs, designed to be your personal companion for productivity, automation, and beyond. Now featuring a modern GUI and powerful file system capabilities.

🌟 Overview

CodenameProj (Cyra) is a AI assistant system that combines voice interaction, computer vision, and intelligent action execution. Built with privacy in mind, it runs entirely on local models with no reliance on cloud services (except optional ElevenLabs TTS).

Key Features

🎨 Modern GUI: Sleek, dark-themed interface built with CustomTkinter.
🎤 Voice Interaction: Continuous, non-blocking speech recognition using Faster Whisper.
🛑 Interruptible Output: Stop the assistant mid-sentence or mid-action instantly.
📂 File System Control: Read, write, list, and delete files directly through voice commands.
✍️ Content Generation: Ask Cyra to write code, poems, or emails, and she'll generate the content and save it to a file.
🧠 Dual AI Models: Separate models for fast JSON parsing (Gemma 3:4b) and natural conversations (Llama 3.1).
👁️ Screen Vision: OCR-powered screen reading with docTR for visual assistance.
💾 Smart Memory: Persistent memory system for context-aware conversations.
🎯 Action System: Execute browser actions, launch apps, web searches, and more.
🔊 Multiple TTS Options: Local TTS and ElevenLabs integration.

🏗️ Architecture

CodenameProj/
├── actions/              # Action handlers and brain logic
│   ├── actions.py        # Core action implementations (File system, Browser, etc.)
│   ├── actions.txt       # Action definitions for AI
│   ├── brain.py          # Ollama model interactions & Content Generation
│   ├── image.py          # Screen capture & OCR
│   └── launchApp.py      # Application launcher
├── database/             # Persistent storage
│   └── memory.json       # Long-term memory
├── utils/                # Utility modules
│   ├── config_manager.py # Configuration management
│   ├── elevenlabsAPI.py  # ElevenLabs TTS integration
│   ├── log.py            # Logging utilities
│   ├── memory.py         # Memory management
│   ├── speechRecog.py    # Async Speech recognition
│   └── tts.py            # Interruptible Local TTS
├── images/               # Screenshot storage (debug mode)
├── outputs/              # OCR text outputs
├── logs/                 # Application logs
├── config.json           # Configuration settings
├── gui.py                # Main Modern GUI Application
├── main.py               # CLI Application (Legacy)
└── requirements.txt      # Python dependencies

🚀 Getting Started

Prerequisites

Python 3.8+
Ollama installed and running
Ollama models: gemma3:4b and llama3.1

Installation

Clone the repository

git clone <your-repo-url>
cd CodenameProj

Create and activate virtual environment

python -m venv .venv
.venv\Scripts\activate  # Windows
source .venv/bin/activate  # Linux/Mac

Install dependencies

pip install -r requirements.txt
pip install mss python-doctr customtkinter  # Additional requirements

Set up environment variables Create a .env file:

OLLAMA_PROMPT="Your system prompt here"
JSON_MODEL=gemma3:4b
CONVERSATION_MODEL=llama3.1
ELEVENLABS_API_KEY=your_key_here  # Optional

Configure settings Edit config.json or use the Settings tab in the GUI.
Pull required Ollama models

ollama pull gemma3:4b
ollama pull llama3.1

Running Cyra

Launch the modern GUI:

python gui.py

💬 Usage

Voice & Chat

Talk: Use the microphone button or enable "Voice Mode" to talk naturally.
Chat: Type messages in the chat input field.
Interrupt: Press the "Stop / Interrupt" button to halt speaking or listening immediately.

File System Actions

Write File: "Write a python script to C:/Users/Name/Documents/script.py" (Cyra will generate the code).
Read File: "Read C:/Users/Name/Documents/note.txt".
List Directory: "What files are in C:/Users/Name/Documents?".
Delete File: "Delete C:/Users/Name/Documents/old.txt".

Other Actions

Open Browser: "Open my browser" or "Launch Firefox".
Search: "Search for machine learning algorithms".
Launch Apps: "Open Spotify" or "Launch VS Code".
See Screen: "Look at my screen" or "What's on my monitor?".
Remember: "Remember that my favorite color is blue".

🔧 Configuration

Use the Settings tab in the GUI to change:

Assistant Name: Customize how Cyra refers to herself.
Ollama Model: Switch between available local models.
Conversation Model: Choose the model for chat responses.

🛠️ Future Development

Client-server architecture for distributed deployment
Raspberry Pi integration for smart home control
Wake word detection for hands-free operation
WebSocket support for real-time communication
Advanced context-aware memory with vector embeddings
Voice cloning for personalized TTS

📋 Requirements

Core dependencies:

customtkinter
faster-whisper
sounddevice
numpy
python-dotenv
ollama
elevenlabs
pyautogui
TTS
pandas
mss
python-doctr

Note: This project is under active development. Features and architecture may change as it evolves toward the ultimate goal: a distributed, voice-activated AI assistant system running across multiple devices.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodenameProj - Cyra

🌟 Overview

Key Features

🏗️ Architecture

🚀 Getting Started

Prerequisites

Installation

Running Cyra

💬 Usage

Voice & Chat

File System Actions

Other Actions

🔧 Configuration

🛠️ Future Development

📋 Requirements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
actions		actions
utils		utils
.gitignore		.gitignore
README.md		README.md
config.json		config.json
gui.py		gui.py
main.py		main.py
requirements.txt		requirements.txt

bdwarker/CodenameProj

Folders and files

Latest commit

History

Repository files navigation

CodenameProj - Cyra

🌟 Overview

Key Features

🏗️ Architecture

🚀 Getting Started

Prerequisites

Installation

Running Cyra

💬 Usage

Voice & Chat

File System Actions

Other Actions

🔧 Configuration

🛠️ Future Development

📋 Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages