Job runner cross-account isolation #4940

nishu-builder · 2026-01-16T23:35:46Z

Cross-account isolation for job runner: jobs run in eval account with no Observatory access, using S3 presigned URLs for all data exchange.

Presigned URL mode in episode runner
Dispatcher generates presigned URLs when EVAL_S3_BUCKET configured
Watcher reads results from S3
LocalStack + eval-jobs namespace for local testing

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add LocalStack process to process-compose.yaml - Add eval-jobs namespace alongside jobs namespace - Make JOB_NAMESPACE configurable via config - Add --s3 flag to server and watcher commands - Update help text with S3 mode instructions Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Server uses host.docker.internal:4566 so presigned URLs work from pods. Watcher uses localhost:4566 since it runs on host. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

nishu-builder · 2026-01-16T23:36:13Z

Job runner cross-account isolation #4940 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

add-to-merge-queue - adds this PR to the back of the merge queue
add-to-merge-queue-as-hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3679fe6746

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-16T23:41:41Z

app_backend/src/metta/app_backend/job_runner/dispatcher.py

+        policy_s3_paths: list[str] = []
+
+        urls = generate_job_presigned_urls(
+            job_id=job.id,
+            policy_s3_paths=policy_s3_paths,


Populate policy URIs before generating presigned URLs

When EVAL_S3_BUCKET is enabled, policy_s3_paths is hard-coded to an empty list and immediately passed into generate_job_presigned_urls, which then overwrites the job spec’s policy_uris with an empty list. In S3 mode this means every job spec sent to the runner has no policies, so PureSingleEpisodeJob validation fails (assignments are out of range) and the job cannot run. This needs to derive policy S3 paths from the existing job spec (or preserve its policy_uris) before generating the presigned URLs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-01-16T23:41:41Z

app_backend/src/metta/app_backend/job_runner/watcher.py

    if phase == "Succeeded":
+        results = read_results_from_s3(job_id)
+        if results:
+            _update_job_with_results(stats_client, job_id, results)


Persist episode_id when consuming S3 results

In S3 mode, the watcher reads results.json from S3 and writes that dict directly into JobRequest.result, but the results payload is a PureSingleEpisodeResult (rewards/stats/steps) and does not include an episode_id. Downstream components (e.g., tournament scoring via JobRequest.episode_id) rely on result["episode_id"], so completed S3-mode jobs will never produce episodes or scores. The watcher should call write_single_episode_to_observatory (or otherwise set episode_id) when it consumes S3 results.

Useful? React with 👍 / 👎.

nishu-builder · 2026-01-17T00:42:56Z

Replaced with stacked PRs #4944-#4949

Nishad and others added 13 commits January 16, 2026 00:10

Update job runner spec with cross-account architecture design

d7bef97

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add .worktrees to gitignore

f9e8983

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add job runner cross-account implementation plan

89d4da0

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add S3 presigned URL utilities

03da3ef

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add presigned URL spec loading to single_episode_runner

216a38c

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add presigned URL result upload to single_episode_runner

0a66c13

Add presigned URL mode to single_episode_runner

e9668bb

Add eval account S3 config to dispatcher

0b569ae

Add presigned URL generation to dispatcher

729bb86

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Update dispatcher to use presigned URLs when configured

f4e8958

Update watcher to read results from S3

a522476

Fix presigned URLs to use host.docker.internal for k8s access

3679fe6

Server uses host.docker.internal:4566 so presigned URLs work from pods. Watcher uses localhost:4566 since it runs on host. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

nishu-builder changed the title ~~Update job runner spec with cross-account architecture design~~ Job runner cross-account isolation Jan 16, 2026

nishu-builder marked this pull request as ready for review January 16, 2026 23:36

chatgpt-codex-connector bot reviewed Jan 16, 2026

View reviewed changes

nishu-builder closed this Jan 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Job runner cross-account isolation #4940

Job runner cross-account isolation #4940

Uh oh!

nishu-builder commented Jan 16, 2026 •

edited

Loading

Uh oh!

nishu-builder commented Jan 16, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Jan 16, 2026

Uh oh!

chatgpt-codex-connector bot Jan 16, 2026

Uh oh!

nishu-builder commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Job runner cross-account isolation #4940

Job runner cross-account isolation #4940

Uh oh!

Conversation

nishu-builder commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nishu-builder commented Jan 16, 2026

How to use the Graphite Merge Queue

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

nishu-builder commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nishu-builder commented Jan 16, 2026 •

edited

Loading