feat: gemini-web-computer-agent — native function-calling loop with BMasterAI telemetry#52
Conversation
…MasterAI telemetry Adds examples/gemini-web-computer-agent/ — a bare-metal Gemini agent combining web search (Tavily) and computer use (screenshot/click/type/ key/scroll) using the Google GenAI SDK, fully instrumented with BMasterAI logging and monitoring. Cross-platform computer use: Linux (xdotool + scrot) and macOS (cliclick + screencapture) with platform-aware error messages. Also updates root README to list the new example at the top of the 2026 section. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
Claude finished @travis-burmaster's task —— View job Pull Request ReviewComprehensive review of the Gemini Web Computer Agent implementation covering code quality, security, performance, and architecture. Todo List
SummaryThis PR adds a well-architected Gemini agent that mirrors the existing Claude agent pattern. The implementation follows good practices with comprehensive BMasterAI instrumentation, cross-platform support, and proper error handling. However, there are several areas for improvement, particularly around security and robustness. ✅ StrengthsArchitecture & Design:
Code Quality:
BMasterAI Integration:
|
Replace three comments that still said "Claude" with "Gemini": - Module docstring header - Section comment above tool schemas - ALL_TOOL_SCHEMAS list comment Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
Claude finished @travis-burmaster's task —— View job Pull Request ReviewComprehensive review of the Gemini Web Computer Agent implementation covering code quality, security, performance, and architecture. Todo List
SummaryThis PR adds a well-architected Gemini agent that mirrors the existing Claude agent pattern. The implementation follows good practices with comprehensive BMasterAI instrumentation, cross-platform support, and proper error handling. However, there are several areas for improvement, particularly around security and robustness. ✅ StrengthsArchitecture & Design:
Code Quality:
BMasterAI Integration:
|
Summary
examples/gemini-web-computer-agent/— a bare-metal Gemini function-calling agent combining web search (Tavily) and computer use (screenshot/click/type/key/scroll), fully instrumented with BMasterAI logging and monitoringclaude-web-computer-agentPart.from_bytesimage parts so Gemini can see the screenFiles
tools.pyweb_searchandcomputer_useschemas + cross-platform dispatchagent.pyWebComputerAgentclass — Gemini function-call loop with full BMasterAI instrumentationmain.pyGEMINI_API_KEY, interactive fallbackrequirements.txtgoogle-genai>=0.5.0,bmasterai>=0.2.3,tavily-python.env.exampleGEMINI_API_KEY+TAVILY_API_KEYTest plan
pip install -r requirements.txtinstalls cleanlypython main.py "Search for latest Gemini news"runs without errorspython main.py "Take a screenshot"returns a computer_use resultlogs/agent.jsonlis written with structured telemetry after each runGenerated with Claude (Cowork mode)