Eyra is an advanced AI-driven Python application designed to analyze your screen content in real time, leveraging OpenAI's vision models to provide insightful feedback through both chat and voice. Whether you're researching, multitasking, or seeking real-time insights, Eyra delivers an efficient, interactive, and automated experience. With its robust feature set and ease of use, Eyra stands out as a powerful productivity and accessibility tool.
Eyra takes the hassle out of manual screen analysis, making it an invaluable tool for professionals, researchers, and power users who need to keep track of screen content, take quick notes, or extract data seamlessly. Its automated analysis capabilities help users quickly interpret visual information, be it from a document, website, or software application. This makes Eyra particularly useful for individuals dealing with large volumes of information, ensuring nothing important is missed while streamlining the process of accessing and understanding visual content.
- Automated Insights: Eyra leverages state-of-the-art AI to automatically extract key details from visual content, saving you time and effort.
- Accessibility Enhancement: Eyra's voice feedback ensures that you can stay updated on important visual information even when you’re on the move or away from the screen, improving accessibility for visually impaired users.
- Cross-Platform Flexibility: Eyra is built to run on all major operating systems, providing a consistent experience regardless of your preferred platform.
- Real-Time Interaction: Through both manual and live modes, users have the flexibility to interact with Eyra on demand or let it autonomously monitor and provide feedback in real time.
- Interactive Capture and Analysis: Users can engage directly with Eyra, capturing screen or webcam images as needed.
- Command-Based Interaction: Simple commands like
#imageor#selfieallow users to capture the screen or webcam view for immediate AI analysis.
- Continuous Monitoring: Eyra actively monitors the screen, capturing visual data at predefined intervals and providing automated analysis.
- Hands-Free Voice Feedback: The built-in text-to-speech feature converts insights into audio, ensuring users receive immediate, actionable information.
- Text-to-Speech Responses: Eyra provides real-time auditory responses to analyzed content, which enhances accessibility and productivity by allowing users to focus elsewhere while still receiving relevant updates.
- Compatible with macOS, Windows, and Linux, Eyra is designed for maximum versatility, providing native support for system-level features across platforms.
- Pre-Processing for Efficiency: Eyra automatically optimizes captured images to ensure efficient use of the OpenAI API, managing resolution and file size without sacrificing analytical accuracy.
- Users can quickly switch between manual and live modes using intuitive keyboard shortcuts (
Ctrl+Shift+Lfor Live Mode,Ctrl+Shift+Mfor Manual Mode), providing a seamless and responsive user experience.
To use Eyra effectively, you'll need the following:
- Python Version: Python 3.8 or newer.
- OpenAI API Key: Required for leveraging the AI analysis capabilities.
- Platform-Specific Dependencies:
- macOS:
imagesnapfor webcam capture. - Windows: PowerShell access.
- Linux:
espeakfor text-to-speech,paplayfor audio playback.
- macOS:
Follow these steps to install Eyra:
-
Clone the Repository:
git clone https://github.com/gabrimatic/eyra.git cd eyra -
Install Python Dependencies:
pip install -r requirements.txt
-
Install System-Specific Dependencies (macOS Example):
brew install imagesnap portaudio
-
Configure Environment Variables: Create a
.envfile to set up API keys and other configurations:OPENAI_API_KEY=your_api_key_here MODEL_NAME=gpt-4o-mini MAX_TOKENS=300 IMAGE_PATH=./images/image.jpg USE_MOCK_CLIENT=false
To launch Eyra, run:
python src/main.py- Use chat commands to interact with Eyra:
#image: Capture a screenshot for analysis.#selfie: Capture a webcam image for analysis./history: View past analyses and chat history./quit: Exit the application.
- To switch to Live Mode, press
Ctrl+Shift+L.
- Eyra will continuously capture and analyze your screen.
- Voice feedback will provide results automatically.
- To switch back to Manual Mode, press
Ctrl+Shift+M. - Press
Ctrl+Cto exit at any time.
Eyra is designed with modularity in mind, allowing developers to easily extend or modify its capabilities. Below is an overview of the project structure:
src/
├── chat/ # Handles chat messaging and user interactions
├── config/ # Configuration and settings management
├── image/ # Image capture and processing
├── modes/ # Logic for switching between manual and live modes
└── utils/ # Helper utilities and common functions
If you need to test Eyra without incurring API costs, set USE_MOCK_CLIENT=true in your .env file. This will simulate API responses for development purposes.
Image paths, model parameters, and other settings can be customized via environment variables, making it easy to tailor Eyra to your specific use case.
The next version of Eyra will focus on integrating with Ollama's local offline AI models. This enhancement will allow users to choose between using the OpenAI API or a local AI model, offering greater flexibility and cost savings.
By leveraging offline models, Eyra aims to provide a completely free, privacy-focused experience that does not rely on external cloud services. Stay tuned for this exciting update, which will make Eyra even more versatile and accessible for all users.
© All rights reserved.

