feat(ai): Compute context utilization on AI spans by constantinius · Pull Request #5814 · getsentry/relay

constantinius · 2026-04-10T17:19:30Z

Closes https://linear.app/getsentry/issue/TET-2220/relay-implement-context-window-usage-per-span

Builds on #5831 which introduced the ModelMetadata global config with context window size.

For each AI span, if the model has a configured context size, set gen_ai.context.window_size and compute gen_ai.context.utilization as total_tokens / context_window_size. These fields were introduced with getsentry/sentry-conventions#315

Co-Authored-By: Claude noreply@anthropic.com

Introduce a new `llmModelMetadata` global config that extends the existing model cost data with context window size. The new `ModelMetadata` struct replaces `ModelCosts` throughout the normalization pipeline, with `ModelCosts` only retained for backwards-compatible deserialization on GlobalConfig. When `ai_model_metadata` is present it is used entirely; otherwise `ai_model_costs` is converted to the new format as a fallback. For each AI span, if the model has a configured context size, set `gen_ai.context.window_size` and compute `gen_ai.context.utilization` as `total_tokens / context_window_size`. Co-Authored-By: Claude <noreply@anthropic.com>

linear-code · 2026-04-10T17:22:14Z

TET-2220 Relay: implement context window usage per span

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit c5f4e71. Configure here.}

jjbayer · 2026-04-13T09:28:53Z

See comment here getsentry/sentry#112656 (comment)

…el-context-usage

obostjancic

lgtm, cc @vgrozdanic

jjbayer

Please update the PR description, the rest LGTM!

constantinius requested a review from a team as a code owner April 10, 2026 17:19

test: adding unit and integration tests for context utilization

c0429a1

sentry bot reviewed Apr 10, 2026

View reviewed changes

Comment thread relay-dynamic-config/src/global.rs

cursor bot reviewed Apr 10, 2026

View reviewed changes

Comment thread relay-server/src/processing/spans/process.rs Outdated

Comment thread relay-event-normalization/src/eap/ai.rs

constantinius added 3 commits April 10, 2026 19:29

chore: add changelog entry

7c44762

fix: preventing copying of config for every normalization

d61c0a6

fix: ensuring zero or negative LLM context sizes

c5f4e71

constantinius requested a review from a team April 10, 2026 17:35

sentry bot reviewed Apr 10, 2026

View reviewed changes

Comment thread relay-event-normalization/src/event.rs

cursor bot reviewed Apr 10, 2026

View reviewed changes

Comment thread relay-server/src/services/processor/span.rs Outdated

constantinius mentioned this pull request Apr 13, 2026

feat(ai-monitoring): Fetch model context size and rename task to fetch_ai_model_info getsentry/sentry#112656

Merged

Merge branch 'master' into constantinius/feat/event-normalization/mod…

bd9a950

…el-context-usage

sentry bot reviewed Apr 16, 2026

View reviewed changes

Comment thread relay-event-normalization/src/eap/ai.rs

fix: unnecessary copy for each span normalization

0464b69

obostjancic approved these changes Apr 16, 2026

View reviewed changes

constantinius added 2 commits April 16, 2026 13:57

chore: fix changelog sequence

726ec4d

test: fix snapshots

9c47911

jjbayer approved these changes Apr 16, 2026

View reviewed changes

Comment thread CHANGELOG.md Outdated

constantinius added 2 commits April 16, 2026 17:36

chore: fix changelog entry

0035b47

chore: move changelog entry to correct section

1a17a94

constantinius changed the title ~~feat(ai): Add ModelMetadata config with context size and utilization~~ feat(ai): Compute context utilization on AI spans Apr 16, 2026

constantinius added this pull request to the merge queue Apr 16, 2026

Merged via the queue into master with commit 3ad9ecf Apr 16, 2026
31 checks passed

constantinius deleted the constantinius/feat/event-normalization/model-context-usage branch April 16, 2026 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): Compute context utilization on AI spans#5814

feat(ai): Compute context utilization on AI spans#5814
constantinius merged 11 commits intomasterfrom
constantinius/feat/event-normalization/model-context-usage

constantinius commented Apr 10, 2026 •

edited

Loading

Uh oh!

linear-code bot commented Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

jjbayer commented Apr 13, 2026

Uh oh!

Uh oh!

obostjancic left a comment

Uh oh!

jjbayer left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

constantinius commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linear-code bot commented Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jjbayer commented Apr 13, 2026

Uh oh!

Uh oh!

obostjancic left a comment

Choose a reason for hiding this comment

Uh oh!

jjbayer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

constantinius commented Apr 10, 2026 •

edited

Loading