Skip to content

Build proactive custom AI bot with VM testing and vision capabilities #1

@coderabbitai

Description

@coderabbitai

Overview

This issue tracks the development of a highly advanced custom AI bot for the OmniBlocks organization with unique proactive capabilities.

Key Features

Proactive Operation

  • Always awake (most of the time): The bot doesn't just rely on mentions or triggers - it actively monitors and engages with the repository
  • Scheduled sleep mode: To avoid wasting tokens indefinitely, the bot can set scheduled alarms (time-based or event-based) to wake up
  • Smart token management: Balances being proactive with efficient resource usage

VM Testing Environment

  • Isolated VM for testing: The bot can execute and test code in its own virtual machine
  • Safe code execution: Run tests, builds, and experiments without affecting the main repository
  • Automated testing workflows: Can verify changes before suggesting them

Multimodal/Vision Capabilities

  • Visual understanding: Can see and interpret UI elements, screenshots, and visual outputs
  • UI interaction: Can click buttons and interact with interfaces in the VM
  • Screenshot attachments: Can capture and attach screenshots to comments for better communication
  • Visual debugging: Can identify visual issues and UI problems

Why This Matters

As noted in the original discussion - nobody has really done this before! Most AI bots are reactive (waiting for mentions/triggers), but a truly proactive bot that can:

  • Monitor repository activity autonomously
  • Test code in real environments
  • Provide visual feedback with screenshots
  • Manage its own resource usage intelligently

...would be groundbreaking in the AI dev tools space.

Background

This idea emerged from issue #249 in scratch-gui where we discussed the collection of AI bots ("clankers") and the opportunity to build something truly unique and tailored to our needs.

Technical Considerations

  • GitHub Actions + Octokit API for repository integration
  • LLM API integration (OpenAI, Anthropic, etc.) with vision models
  • VM/containerization for safe code execution (Docker, etc.)
  • Scheduling system for smart wake/sleep cycles
  • Screenshot capture and image processing capabilities
  • Token usage optimization strategies

Requested by: @supervoidcoder
Reference: OmniBlocks/scratch-gui#249

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions