travis-burmaster · travis-burmaster · Mar 16, 2026 · Mar 16, 2026 · Mar 16, 2026
diff --git a/README.md b/README.md
@@ -28,7 +28,28 @@ Real-world agents you can clone and run. Most recent first.
 
 ### 2026
 
-#### [Claude Web + Computer Agent — Native Tool-Use Loop](examples/claude-web-computer-agent/) `NEW`
+#### [Gemini Web + Computer Agent — Native Function-Calling Loop](examples/gemini-web-computer-agent/) `NEW`
+*March 2026*
+
+A bare-metal Gemini function-calling agent combining **web search** (Tavily) and **computer use** (screenshot/click/type/key/scroll) — no LangGraph, no framework, just the Google GenAI SDK — fully instrumented with BMasterAI logging and telemetry. Cross-platform: works on Linux (xdotool + scrot) and macOS (cliclick + screencapture).
+
+**Stack:** Gemini (Google GenAI SDK), Tavily, xdotool/cliclick, BMasterAI
+
+**What it demonstrates:**
+- The raw Gemini `function_call` / `function_response` message cycle — the core loop behind every Gemini agent
+- Multimodal tool results: screenshots sent back to Gemini as image parts so it can see the screen
+- BMasterAI telemetry on every LLM call, tool dispatch, decision point, and error path
+- Structured JSONL telemetry at `logs/agent.jsonl` — pipe to any analytics tool
+
+```bash
+pip install -r requirements.txt
+cp .env.example .env  # add GEMINI_API_KEY + TAVILY_API_KEY
+python main.py "Search for today's top AI news, open a browser to the first result, take a screenshot, and summarize what you see."
+```
+
+---
+
+#### [Claude Web + Computer Agent — Native Tool-Use Loop](examples/claude-web-computer-agent/)
 *March 2026*
 
 A bare-metal Claude tool-use agent combining **web search** (Tavily) and **computer use** (screenshot/click/type/key/scroll) — no LangGraph, no framework, just the Anthropic SDK — fully instrumented with BMasterAI logging and telemetry. The foundational pattern that every Claude agent is built on.

diff --git a/examples/gemini-web-computer-agent/.env.example b/examples/gemini-web-computer-agent/.env.example
@@ -0,0 +1,5 @@
+# Required
+GEMINI_API_KEY=your_gemini_api_key_here
+
+# Optional — enables web_search tool (generous free tier at tavily.com)
+TAVILY_API_KEY=your_tavily_api_key_here
diff --git a/examples/gemini-web-computer-agent/.gitignore b/examples/gemini-web-computer-agent/.gitignore
@@ -0,0 +1,32 @@
+# Environment variables
+.env
+
+# Logs and telemetry
+logs/
+*.log
+*.jsonl
+
+# Python cache and compiled files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# Virtual environments
+.venv/
+venv/
+env/
+ENV/
+
+# Testing
+.pytest_cache/
+.coverage
+htmlcov/
+
+# macOS
+.DS_Store
+
+# IDEs and Editors
+.vscode/
+.idea/
+*.swp
+*.swo
diff --git a/examples/gemini-web-computer-agent/README.md b/examples/gemini-web-computer-agent/README.md
@@ -0,0 +1,218 @@
+# gemini-web-computer-agent
+
+A bare-metal Gemini tool-use agent combining **web search** and **computer use** — no LangGraph, no framework, just the Google GenAI SDK — fully instrumented with **BMasterAI** logging and telemetry.
+
+This is the foundational pattern that every Gemini agent is built on. Study this before moving to the LangGraph examples.
+
+---
+
+## What It Demonstrates
+
+- The raw Gemini `function_call` / tool response message cycle (the core loop behind every Gemini agent)
+- How to register two complementary tool types — network I/O (web search) and system I/O (computer use)
+- How to send screenshot images back to Gemini as multimodal function response content
+- BMasterAI telemetry on every step of a bare SDK agent — not just framework agents
+
+---
+
+## Architecture
+
+```
+user prompt
+    ↓
+gemini_call (tools: web_search, computer_use)
+    ↓
+stop_reason == "tool_use"?
+   ├── yes → dispatch tool(s) → append tool_result(s) → loop back
+   └── no  → final text response → END
+```
+
+**Tools available:**
+
+| Tool | Description | Implementation |
+|---|---|---|
+| `web_search` | Tavily search — current information from the web | `tavily-python` |
+| `computer_use` | Screenshot, click, type, key, scroll | `xdotool` + `scrot` |
+
+---
+
+## BMasterAI Instrumentation
+
+| Event | BMasterAI call |
+|---|---|
+| Agent starts | `monitor.track_agent_start(AGENT_ID)` + `log_event(AGENT_START)` |
+| Each Gemini API call | `monitor.track_llm_call(...)` + `log_event(LLM_CALL)` ×2 (before + after) |
+| Tool dispatched | `log_event(TOOL_USE)` |
+| Tool result returned | `log_event(TASK_COMPLETE)` or `log_event(TASK_ERROR)` |
+| Loop decision | `log_event(DECISION_POINT, "continue" or "end_turn")` |
+| Any exception | `monitor.track_error(...)` + `log_event(TASK_ERROR)` |
+| Agent finishes | `monitor.track_agent_stop(AGENT_ID)` + `log_event(AGENT_STOP)` |
+| Task timings | `monitor.track_task_duration(...)` per LLM call and per tool call |
+
+Telemetry output:
+
+```
+logs/agent.log                  — human-readable event log
+logs/agent.jsonl                — structured JSON (pipe to any analytics tool)
+logs/reasoning/agent_reasoning.jsonl  — decision points and reasoning chains
+```
+
+---
+
+## Setup
+
+```bash
+source .venv/bin/activate
+pip install -r requirements.txt
+
+# Linux only: install computer use dependencies
+sudo apt-get install scrot xdotool
+```
+
+Copy `.env.example` to `.env` and fill in your keys:
+
+```bash
+cp .env.example .env
+```
+
+Required:
+- `GEMINI_API_KEY` — [aistudio.google.com](https://aistudio.google.com)
+
+Optional (enables `web_search`):
+- `TAVILY_API_KEY` — [tavily.com](https://tavily.com) (generous free tier)
+
+---
+
+## Usage
+
+```bash
+# Pass query as argument
+python main.py "Search for the latest Google Gemini model pricing and calculate
+                the cost of 1 million tokens at Flash rates."
+
+# Or run interactively
+python main.py
+```
+
+### Example queries
+
+```bash
+# Web search
+python main.py "What are the key differences between Gemini 1.5 Pro and Gemini 1.5 Flash?"
+
+# Computer use — screenshot + describe
+python main.py "Take a screenshot of my current screen and describe what applications are open."
+
+# Combined workflow — the core use case for this example
+python main.py "Search for today's top AI news, open a browser to the first result,
+                take a screenshot, and summarize what you see."
+```
+
+---
+
+## Example Output
+
+```
+════════════════════════════════════════════════════════════
+🤖  gemini-web-computer-agent
+────────────────────────────────────────────────────────────
+📝  Query: Search for Gemini 1.5 Pro pricing and calculate cost for 1M tokens
+════════════════════════════════════════════════════════════
+
+🔄  Turn 1/20
+   🧠  gemini-3-flash-preview | 892+87 tokens | 1243ms | stop=tool_use
+   🔧  Tool: web_search({"query": "Gemini 1.5 Pro pricing per token 2026"})
+   ✅  web_search → {"query": "...", "results": [...], "result_count": 5} (412ms)
+
+🔄  Turn 2/20
+   🧠  gemini-3-flash-preview | 2341+63 tokens | 987ms | stop=tool_use
+   🔧  Tool: computer_use({"action": "screenshot"})
+   ✅  computer_use → {"action": "screenshot", "success": true} (312ms)
+
+🔄  Turn 3/20
+   🧠  gemini-3-flash-preview | 2589+312 tokens | 1821ms | stop=end_turn
+
+✅  Done in 3 turn(s)
+
+════════════════════════════════════════════════════════════
+📊  BMASTERAI TELEMETRY
+────────────────────────────────────────────────────────────
+  Agent status : STOPPED
+  Total errors : 0
+
+  Task timings:
+    llm_call_turn_1                     avg=1243ms  calls=1
+    llm_call_turn_2                     avg=987ms   calls=1
+    llm_call_turn_3                     avg=1821ms  calls=1
+    tool_web_search                     avg=412ms   calls=1
+    tool_computer_use                   avg=312ms   calls=1
+
+  Telemetry logs:
+    logs/agent.log       — human-readable
+    logs/agent.jsonl     — structured JSON
+    logs/reasoning/      — decision points & reasoning
+════════════════════════════════════════════════════════════
+
+════════════════════════════════════════════════════════════
+🗒️  FINAL RESPONSE
+────────────────────────────────────────────────────────────
+Based on current pricing, 1 million input tokens with Gemini 1.5 Pro would cost $1.25...
+════════════════════════════════════════════════════════════
+```
+
+---
+
+## Files
+
+| File | Purpose |
+|---|---|
+| `tools.py` | Tool JSON schemas + dispatch functions for both tools |
+| `agent.py` | `WebComputerAgent` class — the tool-use loop with full BMasterAI instrumentation |
+| `main.py` | CLI entry point with env checks and interactive fallback |
+| `requirements.txt` | Python dependencies |
+| `.env.example` | Environment variable template |
+
+---
+
+## Analyse the Telemetry
+
+```bash
+# Show all LLM calls with token counts
+cat logs/agent.jsonl | python3 -c "
+import sys, json
+for line in sys.stdin:
+    e = json.loads(line)
+    if e.get('event_type') == 'llm_call':
+        meta = e.get('metadata', {})
+        if 'input_tokens' in meta:
+            print(f\"{e['timestamp'][:19]}  tokens={meta.get('input_tokens',0)}+{meta.get('output_tokens',0)}  latency={meta.get('latency_ms',0):.0f}ms\")
+"
+
+# Show all tool calls
+cat logs/agent.jsonl | python3 -c "
+import sys, json
+for line in sys.stdin:
+    e = json.loads(line)
+    if e.get('event_type') == 'tool_use':
+        meta = e.get('metadata', {})
+        print(f\"{e['timestamp'][:19]}  tool={meta.get('tool_name')}  input={str(meta.get('input',''))[:80]}\")
+"
+
+# Show errors only
+cat logs/agent.jsonl | python3 -c "
+import sys, json
+for line in sys.stdin:
+    e = json.loads(line)
+    if e.get('event_type') == 'task_error':
+        print(json.dumps(e, indent=2))
+"
+```
+
+---
+
+## Stack
+
+- [Google GenAI Python SDK](https://github.com/google/genai-python)
+- [BMasterAI](https://github.com/travis-burmaster/bmasterai)
+- [Tavily Python](https://github.com/tavily-ai/tavily-python)
+- [xdotool](https://github.com/jordansissel/xdotool) + [scrot](https://github.com/dreamer/scrot) (Linux computer use)