feat: Update to SDK 0.6.0 with built-in async polling #78

greynewell · 2026-01-24T00:06:35Z

Summary

Updates the MCP server to use the new 0.6.0 SDK with built-in async job polling support.

Changes

Upgraded @supermodeltools/sdk from 0.3.8 to 0.6.0
Use SupermodelClient wrapper instead of raw DefaultApi
Simplified API calls - polling is now handled automatically by the SDK
Returns unwrapped results directly instead of async envelopes

Technical Details

The 0.6.0 SDK introduces a SupermodelClient wrapper (see async.ts) that handles all polling logic internally:

Polls until job status is completed or failed
Uses retryAfter hints from the API for efficient polling intervals
Configurable timeouts and retry limits
Progress callbacks for UIs
AbortSignal support for cancellation

This means MCP tools can now use graph generation like synchronous APIs - the client waits and returns the final result.

Before (0.3.8):

const response = await client.graphs.generateSupermodelGraph(params);
// response.summary.filesProcessed

After (0.6.0):

const response = await client.graphs.generateSupermodelGraph(file, { idempotencyKey });
// response.summary.filesProcessed (same shape, but with automatic polling)

Testing

Build succeeds (npm run build)
TypeScript compilation passes
No manual polling logic needed

Closes #77

Summary by CodeRabbit

Chores
- Upgraded core SDK dependency to a newer release for improved stability and compatibility.
Refactor
- Rewired the app’s graph client to use the updated SDK, simplifying graph requests and enabling the SDK’s automatic polling for more reliable graph operations.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-24T00:06:56Z

Walkthrough

Upgrades @supermodeltools/sdk to v0.6.0, replaces direct DefaultApi usage with a SupermodelClient wrapper, updates types to use SupermodelClient, and changes the graph generation call to accept a file blob plus options (SDK now handles polling).

Changes

Cohort / File(s)	Summary
SDK Upgrade `package.json`	Dependency version bump: `@supermodeltools/sdk` from `^0.3.8` → `^0.6.0`
Server init & imports `src/server.ts`	Added `SupermodelClient` import; instantiate `DefaultApi` as `api` and wrap with `new SupermodelClient(api)` for `graphs`
API invocation `src/tools/create-supermodel-graph.ts`	Replaced `client.graphs.generateSupermodelGraph(requestParams)` with `client.graphs.generateSupermodelGraph(fileBlob, { idempotencyKey })`; SDK now handles async polling
Types `src/types.ts`	`ClientContext.graphs` type changed from `DefaultApi` to `SupermodelClient`
Repo ignores `.gitignore`	Added new ignore patterns for benchmarks, evaluations, logs, reports, and related artifacts

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

✨ Upgrade sings, the client wraps with grace,
Blob meets options—polling runs its race,
Types aligned, ignores tucked away,
A lighter call to chart the new display.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: updating the SDK to version 0.6.0 and adopting the new SupermodelClient with built-in async polling.
Linked Issues check	✅ Passed	All objectives from issue `#77` are met: SDK updated to 0.6.0, manual polling replaced with SupermodelClient, response shapes handled correctly, and build/TypeScript validation confirmed.
Out of Scope Changes check	✅ Passed	The .gitignore additions for benchmarks and evaluation artifacts are tangentially related to supporting the SDK update but not core to the PR objectives. However, they appear to be minor housekeeping and don't conflict with the main goals.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 14

🤖 Fix all issues with AI agents

In @.monitor_pid:
- Line 1: Remove the runtime PID file ".monitor_pid" from the repository and add
it to .gitignore so it won’t be committed again: delete the tracked file
".monitor_pid" (use git rm --cached or git rm to remove from the index), commit
that deletion, and append an entry for ".monitor_pid" to your .gitignore (or a
suitable pattern like *.pid) to prevent future commits of this local process
artifact.

In @.vm_ip:
- Line 1: The repository currently contains a real VM public IP in the file
named ".vm_ip"; replace that value with a non-production placeholder (e.g.,
"X.X.X.X") and remove the actual IP from the repo, then move the real IP to a
local-only location (for example a gitignored ".vm_ip.local" or environment
variable like VM_IP in a .env) and update docs/README to instruct developers
where to set it; also add the chosen local filename (or .env) to .gitignore so
production IPs are never committed.

In `@azure-benchmark-setup.sh`:
- Around line 41-44: Replace the insecure scp invocation that uses "-o
StrictHostKeyChecking=no" with a workflow that pre-populates the SSH known_hosts
for the target host and then performs scp normally; specifically, run
ssh-keyscan (or fetch the host key out-of-band) for $VM_IP and append it to
~/.ssh/known_hosts (or a temporary known_hosts file), then call scp (the same
command that copies mcpbr-config.yaml to "$ADMIN_USER@$VM_IP:~/") without
disabling StrictHostKeyChecking (or with "-o UserKnownHostsFile=... -o
StrictHostKeyChecking=yes" if using a temp file). Apply the same change to the
other ssh/scp occurrences that currently use StrictHostKeyChecking=no.
- Around line 78-80: The script currently pipes NodeSource's setup script into
bash (curl ... | sudo -E bash -) which bypasses signature verification; replace
that sequence by fetching the NodeSource GPG key (nodesource-repo.gpg.key),
dearmor it into /etc/apt/keyrings/nodesource.gpg, create a signed APT source
entry for node_20.x (using signed-by=/etc/apt/keyrings/nodesource.gpg), run
apt-get update, then install nodejs; remove the curl | bash usage and ensure
ca-certificates, curl, and gnupg are installed beforehand so apt can verify the
repository signature.

In `@BENCHMARK_INFO.md`:
- Around line 8-45: Replace the hardcoded public IP (104.211.53.217) in
BENCHMARK_INFO.md with a placeholder (e.g., ${VM_IP} or <VM_IP>) and update the
example commands (reconnect.sh, monitor-benchmark.sh, ssh azureuser@..., tail -f
~/benchmark.log, and "mcpbr state") to reference a local config or environment
variable (e.g., read from a gitignored .vm_ip file or $VM_IP env var); also add
a brief note pointing users to the .vm_ip file or env var for the real IP and
ensure .vm_ip is listed in .gitignore so the live IP is not committed.

In `@continuous_monitor.sh`:
- Around line 27-28: The SSH invocation that sets STATE_OUTPUT uses -o
StrictHostKeyChecking=no which disables host key verification; instead pre-seed
the target host key into known_hosts (e.g., use ssh-keyscan "$VM_IP" >>
~/.ssh/known_hosts or a temporary known_hosts file) and remove the
StrictHostKeyChecking=no option so the ssh in the STATE_OUTPUT assignment (and
the other similar invocation at lines referencing the same pattern) performs
normal host key verification; ensure you reference ADMIN_USER and VM_IP when
populating the known_hosts entry and point ssh to the proper known_hosts via -o
UserKnownHostsFile if using a temp file.
- Around line 95-141: After parsing STATE_OUTPUT, validate that numeric fields
(COMPLETED, TOTAL, IN_PROGRESS, REMAINING, PERCENT, MCP_RESOLVED,
BASELINE_RESOLVED, MCP_TIMEOUT, BASELINE_TIMEOUT, UPDATED) contain only digits
(or set sensible defaults) before doing any arithmetic or comparisons; if any
required numeric value (especially COMPLETED or TOTAL) is empty or non-numeric,
log the raw STATE_OUTPUT/ERROR line and skip the math/stall checks (or treat
missing values as 0) to avoid "integer expression expected" failures. Insert
this guard immediately after the parsing block (where COMPLETED/TOTAL are
assigned) and before the CURRENT_TIME/TIME_DIFF and the if [ "$COMPLETED" -eq
... ] checks; use a POSIX-safe numeric test (e.g. a regex like ^[0-9]+$) against
COMPLETED and TOTAL, bail out of the numeric section or reset to 0 when
validation fails, and ensure RATE and TIME_DIFF computations only run when
TIME_DIFF > 0 and inputs are valid.

In `@CURRENT_RUN_STATUS.md`:
- Around line 33-35: The emphasized line "*Auto-updated every 15 minutes*" is
flagged as a pseudo‑heading by markdownlint MD036; replace the emphasis with
either a proper heading (e.g., "## Auto-updated every 15 minutes") or a plain
sentence ("Auto-updated every 15 minutes.") to remove the pseudo‑heading
formatting while keeping the same content; locate the line containing
"*Auto-updated every 15 minutes*" in CURRENT_RUN_STATUS.md and update it
accordingly.

In `@current_status_report.txt`:
- Line 3: Replace the literal "Report Time: $(date)" so the file contains a real
timestamp or an explicit placeholder; update the line "Report Time: $(date)" to
either a generated timestamp string (e.g., current ISO datetime) or a clear
placeholder like "Report Time: <generated_at>" so readers don’t see the
unevaluated $(date).

In `@enhanced_monitor.sh`:
- Line 12: Remove the insecure option StrictHostKeyChecking=no from the ssh
invocation and pre-seed the target host key into a known_hosts file before
opening the connection: run ssh-keyscan -t rsa,ecdsa,ed25519 "$VM_IP" and append
the output to a trusted known_hosts (for example "$HOME/.ssh/known_hosts" or a
dedicated file), ensure the file has correct permissions (chmod 600), then call
ssh with -o UserKnownHostsFile=path_to_known_hosts and keep -o ConnectTimeout=10
and the existing "$ADMIN_USER@$VM_IP" invocation (the here-doc and tee to
"$LOG_FILE" remain unchanged) so SSH host key verification stays enabled and you
still log output.

In `@MCP_TOOL_CALL_ANALYSIS.md`:
- Around line 5-8: Update the root-cause wording in MCP_TOOL_CALL_ANALYSIS.md so
it aligns with SUPERMODEL_MCP_INVESTIGATION.md: either change the claim that
"graph_status" failures are the primary root cause to a preliminary hypothesis
or revise it to reflect that the investigation found "summary" failing due to a
missing API key while "graph_status" usually succeeds; explicitly mention both
observations (graph_status failure rate and summary/API key finding) and add a
short sentence clarifying which claim is provisional to avoid mixed signals,
referencing the terms "graph_status" and "summary" and the investigation
document name SUPERMODEL_MCP_INVESTIGATION.md.

In `@MORNING_BRIEFING.md`:
- Around line 15-17: Replace the hardcoded IP in the ssh example ("ssh
azureuser@23.100.27.132" in MORNING_BRIEFING.md) with a reference to the .vm_ip
file or a <VM_IP> placeholder (e.g., use `ssh azureuser@$(cat .vm_ip)` or `ssh
azureuser@<VM_IP>`), and update any other docs that contain the same literal
(START_HERE.md, CURRENT_RUN_STATUS.md, BENCHMARK_RUNBOOK.md,
OVERNIGHT_SUMMARY.md, .claude_notes) to use the .vm_ip reference or placeholder
consistently; keep the tmux command (tmux attach -t mcpbr) unchanged.

In `@OVERNIGHT_SUMMARY.md`:
- Around line 119-125: Update the fenced code block containing the file list
(the block delimited by the triple backticks ``` that includes lines like
~/mcpbr-config.yaml, ~/comparison.log, ~/run-comparison.sh, ~/.mcpbr_state/) to
add a language tag by changing the opening fence to ```bash so the block is
explicitly marked as bash and satisfies Markdownlint MD040.

In `@SUPERMODEL_MCP_INVESTIGATION.md`:
- Around line 27-31: The document contains real API keys embedded in example
output (e.g., the SUPERMODEL_API_KEY environment variable showing values
starting with "smsk_live_"); remove or redact those sensitive values by
replacing them with a placeholder (e.g.,
SUPERMODEL_API_KEY='REDACTED_OR_ROTATED_KEY') in all occurrences (notably the
shown examples around the visible snippet and the referenced ranges 148-152 and
246-249), then rotate the exposed key immediately and coordinate git history
scrubbing to remove the leaked secret from the repository history.

🧹 Nitpick comments (1)

.mcpbr_state/evaluation_state.json (1)

1-521: Consider keeping run-state out of version control.

This file looks like a live run artifact. Every new run will rewrite timestamps and task stats, creating huge, noisy diffs. A simple pattern is: add .mcpbr_state/ to .gitignore, and keep a small redacted sample in docs/ if you need a reference snapshot.

continuous_monitor.sh

CURRENT_RUN_STATUS.md

current_status_report.txt

Updates: - Upgraded @supermodeltools/sdk from 0.3.8 to 0.6.0 - Use SupermodelClient wrapper with automatic job polling - Simplified API calls - polling is now handled by SDK - Returns unwrapped results directly instead of async envelopes The SDK's SupermodelClient handles all polling logic internally, so MCP tools can use graph generation like synchronous APIs. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Prevents evaluation, monitoring, and benchmark files from being accidentally committed to the repository.

coderabbitai bot reviewed Jan 24, 2026

View reviewed changes

continuous_monitor.sh Outdated Show resolved Hide resolved

continuous_monitor.sh Outdated Show resolved Hide resolved

CURRENT_RUN_STATUS.md Outdated Show resolved Hide resolved

current_status_report.txt Outdated Show resolved Hide resolved

greynewell force-pushed the sdk-0.6.0-update branch from 7bb73eb to eeb8314 Compare January 24, 2026 00:21

chore: Add benchmark artifacts to .gitignore

bdecbba

Prevents evaluation, monitoring, and benchmark files from being accidentally committed to the repository.

supermodeltools deleted a comment from coderabbitai bot Jan 24, 2026

greynewell merged commit 77bb0cf into main Jan 24, 2026
2 checks passed

coderabbitai bot mentioned this pull request Jan 24, 2026

feat: Update SDK to 0.6.2 #79

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Update to SDK 0.6.0 with built-in async polling #78

feat: Update to SDK 0.6.0 with built-in async polling #78

Uh oh!

greynewell commented Jan 24, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 24, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Update to SDK 0.6.0 with built-in async polling #78

feat: Update to SDK 0.6.0 with built-in async polling #78

Uh oh!

Conversation

greynewell commented Jan 24, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Technical Details

Before (0.3.8):

After (0.6.0):

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greynewell commented Jan 24, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 24, 2026 •

edited

Loading