fix: replace blunt pop with assistant message rewriting in _separate_tool_calls by jaideepr97 · Pull Request #5303 · llamastack/llama-stack

jaideepr97 · 2026-03-25T21:39:28Z

This patch was generated by Claude.

Summary

_separate_tool_calls() called next_turn_messages.pop() inside the tool call loop, once per tool call needing approval. With N such tool calls, pop() fired N times — the first correctly removed the assistant message, but subsequent pops destroyed unrelated conversation messages (user message, system message, etc.).

Additionally, even in the single-tool-call case, the pop removed the entire assistant message including tool calls that were approved and executed — leaving orphaned tool results in the conversation history and model context.

Fix

Replace the per-tool-call pop with intelligent assistant message handling based on what was actually executed:

All deferred/denied: pop the assistant message entirely (no executed tool calls to preserve context for)
Mixed (some executed, some deferred/denied): replace the assistant message with a new one containing only the executed tool calls, keeping the conversation history coherent with the tool results that follow
All executed: leave the assistant message untouched

Test plan

10 regression tests covering all three cases (all-deferred, mixed, all-executed)
Verified original messages (system, user) are never corrupted regardless of number of tool calls
Full responses unit test suite passes (234 tests)

jaideepr97 · 2026-03-25T21:41:53Z

@grs Please take a look at this when you get a second, would appreciate your inputs!

gyliu513 · 2026-03-27T14:47:06Z

tests/unit/providers/inline/responses/builtin/responses/test_approval_pop_bug.py

@@ -0,0 +1,348 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.


Instead of creating a new file, can you put your test case to tests/unit/providers/inline/responses/builtin/responses/test_streaming.py ?

updated, thanks!

gyliu513 · 2026-03-27T14:48:21Z

@leseb already do some review for #5294, hope he can resume the review in this PR.

…ltiple MCP approval tool calls The _separate_tool_calls() method called next_turn_messages.pop() inside the tool call loop, once per tool call needing approval. With N tool calls requiring approval, pop() fired N times — the first correctly removed the assistant message, but subsequent pops destroyed unrelated conversation messages (user message, system message, etc.). Track whether a pop is needed with a boolean flag and execute it at most once after the tool call loop completes. Fixes: llamastack#5301 Signed-off-by: Jaideep Rao <jrao@redhat.com> Made-with: Cursor

…tool_calls The _separate_tool_calls() method called next_turn_messages.pop() inside the tool call loop, once per tool call needing approval. With N such tool calls, pop() fired N times — the first correctly removed the assistant message, but subsequent pops destroyed unrelated conversation messages. Replace the per-tool-call pop with intelligent assistant message handling: - All deferred/denied: pop the assistant message entirely (no tool calls to preserve context for) - Mixed (some executed, some deferred): replace the assistant message with one containing only the executed tool calls, keeping the conversation history coherent with the tool results that follow - All executed: leave the assistant message untouched Fixes: llamastack#5301 Signed-off-by: Jaideep Rao <jrao@redhat.com> Made-with: Cursor

Consolidate the regression tests for the pop() bug fix from the standalone test_approval_pop_bug.py into the existing test_streaming.py file per reviewer feedback. Signed-off-by: Jaideep Rao <jrao@redhat.com> Made-with: Cursor Signed-off-by: Jaideep Rao <jrao@redhat.com>

jaideepr97 requested review from ashwinb, bbrowning, cdoern, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners March 25, 2026 21:39

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 25, 2026

grs approved these changes Mar 26, 2026

View reviewed changes

jaideepr97 mentioned this pull request Mar 27, 2026

fix: prevent conversation history corruption when multiple tool calls require approval #5294

Closed

gyliu513 reviewed Mar 27, 2026

View reviewed changes

jaideepr97 added 3 commits March 27, 2026 11:54

jaideepr97 force-pushed the fix/approval-pop-history-corruption branch from 68c8651 to 7ac4102 Compare March 27, 2026 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: replace blunt pop with assistant message rewriting in _separate_tool_calls#5303

fix: replace blunt pop with assistant message rewriting in _separate_tool_calls#5303
jaideepr97 wants to merge 3 commits intollamastack:mainfrom
jaideepr97:fix/approval-pop-history-corruption

jaideepr97 commented Mar 25, 2026

Uh oh!

jaideepr97 commented Mar 25, 2026

Uh oh!

gyliu513 Mar 27, 2026

Uh oh!

jaideepr97 Mar 27, 2026

Uh oh!

gyliu513 commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,348 @@
		# Copyright (c) Meta Platforms, Inc. and affiliates.

Conversation

jaideepr97 commented Mar 25, 2026

Summary

Fix

Test plan

Uh oh!

jaideepr97 commented Mar 25, 2026

Uh oh!

gyliu513 Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

jaideepr97 Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

gyliu513 commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants