[RFC] Agent Categorization + ReACT Agent OutputParser by yanxi0830 · Pull Request #955 · llamastack/llama-stack

yanxi0830 · 2025-02-04T20:00:01Z

Problem

We want to standardize the steps necessary for users to build a ReACT agent with the ability to interleave between generating thoughts and taking task specific actions dynamically.

The current agent orchestration loop requires ad hoc logic for intercepting agent outputs and parsing outputs from output messages to fit a ReACT framework (example). This proposes changes to LlamaStack client SDKs and server APIs for better ergonomics to build an ReACT agent.

Proposed Solution

We want to have the flexibility to configure custom prompts and custom output parsers in agent loop execution.

Introduce the notion of OutputParser for parsing outputs from ReACT prompting into ToolCall agent output.
Our current agent loop with custom tool calls will loop and call tools until there’s no more tool response. In ReACT framework, action output typically maps to a tool call. We can re-utilize the agent loop, but add a parsing logic right after agent outputs to populate “action” into ToolCall to enable ReACT.

Client Agent SDK [RFC] Client Agent SDK OutputParser llama-stack-client-python#121
We need to incorporate similar output parsing on server for ReACT with builtin tools.

For further generalization: RFC for high-level concept categorization of what defines an Agent Type.

Current Agent Types Summary

An Agent instance is defined by an AgentConfig
An Agent instance can be categorized into several classes
- Vanilla
  - keep track of conversation loop history
- RAG
  - access to "builtin::rag" tool
  - we current first force retrieve context by explicitly calling memory tool
- ToolCalling
  - could be configured to use "builtin::websearch" / "builtin::code_interpreter" / "builtin::wolfram_alpha" / "builtin::filesystem" / etc
- ReACT
  - require custom output parser to execute "action" as tool calls

Agent Type	Agent Config Template	System Prompt	Output Parser	Orchestration	Note
Vanilla	raw Agent	instruction		Conversation Loop
Tool Calling	toolgroups=[“builtin::websearch”, “builtin::code_interpreter”]	default tool prompt + instruction	decode_assistant_message_from_content	Loop until there’s no more tool calls Pass tool response as next turn (built-in tool & custom tool differ)
RAG	toolgroups=(builtin::rag, args: {vector_db_ids}) force_retrieval=?	default tool prompt + instruction		Retrieve context from RAG tool before calling generation.	We should add an ability to force retrieval & ability for auto retrieval via model tool calling
ReACT	instructions=react_prompt output_parser=react_output_parser	ReACT prompting (thought-action-answer)	Parse from action / action_input into ToolCall as part of Agent Response.	Loop until there’s no more tool calls Pass tool response as next turn

Proof of Concept Implementation

llama-stack-client-python: [RFC] Client Agent SDK OutputParser llama-stack-client-python#121
llama-stack-apps: feat: ReACT agent example llama-stack-apps#166
llama-stack: this PR
llama-models: [RFC] response output type meta-llama/llama-models#272

yanxi0830 · 2025-02-04T23:06:38Z

llama_stack/apis/inference/inference.py

    response_format: Optional[ResponseFormat] = None
    stream: Optional[bool] = False
    logprobs: Optional[LogProbConfig] = None
+    response_output_parser: Optional[ResponseOutputParser] = Field(default=ResponseOutputParser.default)


Synced offline w/ @ashwinb @hardikjshah , we will hold off this and put parsers on client SDK side.

hardikjshah · 2025-02-04T23:10:57Z

Summarizing what we discussed offline --

No need for a ResponseOutputParser as this can be done client side ( for the time being )
Lets go away from the custom format to structured outputs

Thought: I need to transform the image that I received in the previous observation to make it green.
Action:
{
  "action": "image_transformer",
  "action_input": {"image": "image_1.jpg"}
}<end_action>

class Response: 
    thought: str
    tool_name: Optional[str]
    tool_params: Optional[str]
    answer: Optional[str]

[ not sure if this will work, i think you ll have to test on this format and see what works , for eg. does optional work consistently ]

Encapsulate all ReACTAgent logic in one class so that end users using it can do so with 1-2 lines of code.

yanxi0830 · 2025-02-05T22:17:30Z

Closing PR, moving to https://github.com/meta-llama/llama-stack/discussions/975

yanxi0830 added 4 commits February 4, 2025 10:27

tmp

5595f5b

revert print

b1492ec

tmp

4d07460

api change

662f171

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 4, 2025

This was referenced Feb 4, 2025

[RFC] Client Agent SDK OutputParser llamastack/llama-stack-client-python#121

Merged

feat: ReACT agent example llamastack/llama-stack-apps#166

Merged

yanxi0830 marked this pull request as ready for review February 4, 2025 20:03

yanxi0830 requested review from ashwinb, dineshyv, dltn, ehhuang, hardikjshah, raghotham, sixianyi0721 and vladimirivic as code owners February 4, 2025 20:03

yanxi0830 mentioned this pull request Feb 4, 2025

[RFC] response output type meta-llama/llama-models#272

Closed

yanxi0830 commented Feb 4, 2025

View reviewed changes

remove response parser

42f0e91

yanxi0830 closed this Feb 5, 2025

yanxi0830 deleted the react_agent branch March 15, 2025 20:38

raghotham mentioned this pull request Jan 13, 2026

Discussion #975: [RFC] Agent Categorization + ReACT Agent OutputParser #4582

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Agent Categorization + ReACT Agent OutputParser#955

[RFC] Agent Categorization + ReACT Agent OutputParser#955
yanxi0830 wants to merge 5 commits intomainfrom
react_agent

yanxi0830 commented Feb 4, 2025 •

edited

Loading

Uh oh!

yanxi0830 Feb 4, 2025

Uh oh!

hardikjshah commented Feb 4, 2025

Uh oh!

yanxi0830 commented Feb 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yanxi0830 commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Proposed Solution

Current Agent Types Summary

Proof of Concept Implementation

Uh oh!

yanxi0830 Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

hardikjshah commented Feb 4, 2025

Uh oh!

yanxi0830 commented Feb 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yanxi0830 commented Feb 4, 2025 •

edited

Loading