OLS-1406: Add tool output truncation #2717

onmete · 2026-01-26T10:27:13Z

Description

Add tool output truncation

Type of change

Related Tickets & Documents

Related Issue #
Closes #

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.
How were the fix/results from this change verified? Please provide relevant screenshots or results.

openshift-ci-robot · 2026-01-26T10:27:17Z

openshift-ci · 2026-01-27T12:05:05Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign tisnik for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

asamal4

Overall tool output truncation logic is fine. But we will have to handle few other scenarios to avoid any error because of context window limit..

enforce overall tool token usage limit. Imagine a scenario where we may have multiple small parallel tool outputs. But overall token size may be more.
consider tools definition for initial prompt token calculation.
need to consider AIMessage in each iteration for input token calculation for next iteration
need to re-calculate available tokens for each tool iteration. With every iteration (or with parallel tool calls) AIMessage & tool output will be added to the prompt.
token count should be weighted.

asamal4 · 2026-01-28T06:41:29Z

ols/utils/token_handler.py

+            if truncation occurred
+        """
+        tokens = self.text_to_tokens(text)
+        token_count = len(tokens)


We should consider adding the weight, the constant is already there with weight 1.1

asamal4 · 2026-01-28T07:01:41Z

ols/app/models/config.py

+    max_tokens_per_tool_output: PositiveInt = (
+        constants.DEFAULT_MAX_TOKENS_PER_TOOL_OUTPUT
+    )
+    max_tokens_for_tools: PositiveInt = constants.DEFAULT_MAX_TOKENS_FOR_TOOLS


This is just used for initial token reservation, never enforced later. It is possible that after multiple iteration overall limit will exceed

asamal4 · 2026-01-28T07:02:41Z

ols/src/query_helpers/docs_summarizer.py

@@ -388,17 +393,29 @@ async def iterate_with_tools(  # noqa: C901

                    # execute tools and add to messages
                    tool_calls_messages = await execute_tool_calls(


we need to recalculate available tokens for every iteration.

xrajesh · 2026-01-28T22:09:07Z

@onmete We want to keep the summarization separate ? how would that impact this calculation ?

onmete · 2026-01-29T06:44:20Z

@xrajesh Partially - the logic for truncating/cut-off of very long tool output will be still be required (output that don't fit into the context window), the rest will be replaced by the summarization logic.
I assume you are talking about summarization of long conversations/contex, rather than truncation.

onmete · 2026-01-29T12:22:26Z

@asamal4 comments addressed.
Changes:

token budget is now enforced across iterations, not just reserved upfront
later iterations get progressively smaller limits as budget is consumed
tool results are not stored in conversation history (confirmed existing behavior is correct)

onmete · 2026-01-29T14:27:18Z

watson timeout
/test e2e-ols-cluster

openshift-ci · 2026-01-29T15:45:49Z

@onmete: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Add tool output truncation

b090dd5

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Jan 26, 2026

openshift-ci bot requested review from tisnik and xrajesh January 26, 2026 10:27

Fix test

a250927

asamal4 reviewed Jan 28, 2026

View reviewed changes

Use token buffer

a1ec3d6

onmete added 2 commits January 29, 2026 08:04

Cumulative tool budget

381e8ee

Lint fix

3b03866

		@@ -388,17 +393,29 @@ async def iterate_with_tools( # noqa: C901

		# execute tools and add to messages
		tool_calls_messages = await execute_tool_calls(

OLS-1406: Add tool output truncation #2717

Are you sure you want to change the base?

OLS-1406: Add tool output truncation #2717

Uh oh!

Conversation

onmete commented Jan 26, 2026

Description

Type of change

Related Tickets & Documents

Checklist before requesting a review

Testing

Uh oh!

openshift-ci-robot commented Jan 26, 2026 • edited by openshift-ci bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Related Tickets & Documents

Checklist before requesting a review

Testing

Uh oh!

openshift-ci bot commented Jan 27, 2026

Uh oh!

asamal4 left a comment

Choose a reason for hiding this comment

Uh oh!

asamal4 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

asamal4 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

asamal4 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

xrajesh commented Jan 28, 2026

Uh oh!

onmete commented Jan 29, 2026

Uh oh!

onmete commented Jan 29, 2026

Uh oh!

onmete commented Jan 29, 2026

Uh oh!

openshift-ci bot commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

openshift-ci-robot commented Jan 26, 2026 •

edited by openshift-ci bot

Loading