Improve <think>/<thinking> tag identification in streaming responses #40

mludvig · 2025-12-10T01:23:35Z

Some models like Amazon Nova send their thoughts wrapped in .. or .. tags. It is not guaranteed that these strings arrive as single streaming events, in fact more often than not they don't and we may receive '<', 'thinking', '>' tokens separately.

This patch introduces a state machine that deals with such a fragmented stream, identifies the tags and properly emits the thoughts content as ContentTypes.THINK for a correct frontend rendering. Fixes the output from at least Amazon Nova and quite likely from other models that emit these tags too.

Copilot

Pull request overview

This PR replaces the previous batch-processing approach to detecting <think> and <thinking> tags with a streaming state machine that handles tags split across multiple chunks. This is essential for models like Amazon Nova that emit these tags in fragments rather than complete units.

Key changes:

Implements a 4-state machine ('normal', 'buffering_open', 'thinking', 'buffering_close') to detect and strip thinking tags from streaming content
Removes the old parseThinkingContent function that required complete strings
Adds state tracking fields (thinkingState, tagBuffer) to AgentContext for maintaining parser state across chunks

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
src/stream.ts	Removed old batch `parseThinkingContent` function and replaced conditional logic with streaming state machine that processes content character-by-character while buffering potential tag sequences
src/specs/fragmented-thinking.test.ts	Added new test file to verify the state machine correctly handles thinking tags split by whitespace boundaries using the fake streaming model
src/agents/AgentContext.ts	Added `thinkingState` and `tagBuffer` fields to maintain state machine context across streaming chunks

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/specs/fragmented-thinking.test.ts

src/stream.ts

src/specs/fragmented-thinking.test.ts

src/agents/AgentContext.ts

Add state machine to detect <thinking> and <think> tags that arrive split across multiple streaming chunks. This fixes Amazon Nova and similar models that send tokens like '<thinking', '>', content, '</', 'thinking', '>' as separate stream events. The state machine buffers content starting with '<' and waits for enough characters to determine if it's a thinking tag before routing to TEXT or THINK content types.

Verifies that <thinking> tags are correctly detected and stripped from streamed content, with thinking content routed to THINK type and regular text routed to TEXT type.

The state machine now strips <think>/<thinking> tags from content, so update the test to expect capture group 1 (content inside tags) instead of capture group 0 (entire match including tags).

Ensures partial tags aren't lost when stream ends mid-buffer

mludvig · 2025-12-10T03:12:05Z

@danny-avila copilot comments resolved

Copilot AI review requested due to automatic review settings December 10, 2025 01:23

Copilot started reviewing on behalf of mludvig December 10, 2025 01:24 View session

Copilot AI reviewed Dec 10, 2025

View reviewed changes

mludvig added 6 commits December 10, 2025 16:08

test: Add test for fragmented thinking tag parsing

e52f3c5

Verifies that <thinking> tags are correctly detected and stripped from streamed content, with thinking content routed to THINK type and regular text routed to TEXT type.

test: Update reasoning test to expect stripped tags

5c5757f

The state machine now strips <think>/<thinking> tags from content, so update the test to expect capture group 1 (content inside tags) instead of capture group 0 (entire match including tags).

fix: Flush thinking buffer at stream end in ModelEndHandler

921feba

Ensures partial tags aren't lost when stream ends mid-buffer

test: Add more test cases for thinking tag handling

d1f0d4d

doc: Describe state machine states

0a493ef

mludvig force-pushed the feat/nova-thinking branch from a9abb21 to 0a493ef Compare December 10, 2025 03:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve <think>/<thinking> tag identification in streaming responses #40

Improve <think>/<thinking> tag identification in streaming responses #40

Uh oh!

mludvig commented Dec 10, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mludvig commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Improve <think>/<thinking> tag identification in streaming responses #40

Are you sure you want to change the base?

Improve <think>/<thinking> tag identification in streaming responses #40

Uh oh!

Conversation

mludvig commented Dec 10, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mludvig commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant