feat: Add vLLM tool parsing support in completions #646

gitlost-murali · 2025-12-14T21:00:14Z

Description

This pull request adds support for parsing and extracting structured tool calls from model outputs in the generator, making it possible to handle tool-augmented chat completions. It introduces a configurable tool parser, updates the Completion data model to include tool call information, and adds comprehensive unit tests for both tool parsing and non-tool parsing scenarios.

What this does

When users send requests with tool parsing enabled like this:

formatted_request = tokenizer.apply_chat_template(
    as_chat,
    tools=tools,
    tokenize=False,
    add_generation_prompt=True,
)

response = await policy.generate.route(formatted_request)
completion = response[0]

The completion object will now have:

completion.tool_calls - List of parsed tool calls from the model output
completion.has_tool_calls - Boolean property to check if any tool calls exist
completion.content - The text content excluding tool call tags

To enable this, set tool_call_parser="hermes" when creating the policy.

Configuration

To enable tool parsing, add tool_call_parser to your policy configuration in the YAML file:

# Policy configuration
policy:
  engine_args:
    model: ${model}
    tensor_parallel_size: 2
  sampling_params:
    n: ${group_size}
    max_tokens: ${max_res_tokens}
  tool_call_parser: "hermes"  # Enable tool call parsing (optional)

Test Plan

Unit Tests

Added unit tests covering both scenarios: with and without tool parsing, verifying correct extraction and population of tool call information. (tests/unit_tests/test_generator.py)

Integration Tests (Real-World Example)

Added integration tests that validate the full tool-calling workflow with an actual model (Qwen/Qwen3-0.6B):

pytest tests/integration_tests/test_tool_parsing.py -v -s

Tests:

test_tool_parsing_multi_turn - End-to-end tool calling workflow:
- User asks "Calculate 123 + 456"
- Model generates tool call
- Tool call is extracted: calculator(equation="123 + 456") with hermes parser
- Calculator executes, result fed back to model
- Model returns final answer containing "579"
test_content_without_tool_calls - Verifies non-tool requests:
- User asks "What is the capital of France?"
- Confirms tool_calls == [] and content == text

daniellepintz · 2025-12-15T13:58:37Z

Hi @gitlost-murali! Thanks for the PR! I am wondering do you have a real world example you could test this on? (and add to Test Plan)

daniellepintz

Looks like unit tests are failing as well

gitlost-murali · 2025-12-17T20:15:11Z

Hi @daniellepintz , Thanks! I fixed the tests.

Specifically, I added an end-end integration test which reflects multi-turn chat usage. In unit tests, I use mocked responses to test the actual parsing functionality.

W.r.t test plan, I added integration tests with real world use case and mentioned how to run it along with the expected flow. Is that what is expected? Happy to change the test plan section

…te test accordingly.

…when tool parser is enabled

…zer stub and mocked responses

daniellepintz · 2026-01-04T23:04:38Z

Hi @gitlost-murali, thanks for the PR. Unfortunately, similar to the other PRs, we prefer to not make changes to generator.py until #669 is resolved :/ So let's check back in at that point

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 14, 2025

daniellepintz requested changes Dec 15, 2025

View reviewed changes

gitlost-murali force-pushed the check-vllm branch from ca2799b to 883494e Compare December 17, 2025 20:08

gitlost-murali requested a review from daniellepintz December 17, 2025 20:15

gitlost-murali and others added 7 commits December 21, 2025 08:43

feat: add vllm tool parsing in completions

8ba18ed

refactor: remove exploratory notebook and unused ToolDefinition. Upda…

99b1fed

…te test accordingly.

test: Update test cases for less tokens and add new no-tool scenario …

5dbcb82

…when tool parser is enabled

test: avoid downloading model and tokenizer from internet. use tokeni…

811603e

…zer stub and mocked responses

fix: update tool parser retrieval to use keys from ToolParserManager

8a46cf3

chore: fix linting

1c37863

test: add integration tests for vLLM tool parsing workflow

2a6face

gitlost-murali force-pushed the check-vllm branch from 883494e to 2a6face Compare December 21, 2025 08:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add vLLM tool parsing support in completions #646

feat: Add vLLM tool parsing support in completions #646

gitlost-murali commented Dec 14, 2025 •

edited

Loading

Uh oh!

daniellepintz commented Dec 15, 2025

Uh oh!

daniellepintz left a comment

Uh oh!

gitlost-murali commented Dec 17, 2025

Uh oh!

daniellepintz commented Jan 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Add vLLM tool parsing support in completions #646

Are you sure you want to change the base?

feat: Add vLLM tool parsing support in completions #646

Conversation

gitlost-murali commented Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

What this does

Configuration

Test Plan

Unit Tests

Integration Tests (Real-World Example)

Uh oh!

daniellepintz commented Dec 15, 2025

Uh oh!

daniellepintz left a comment

Choose a reason for hiding this comment

Uh oh!

gitlost-murali commented Dec 17, 2025

Uh oh!

daniellepintz commented Jan 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gitlost-murali commented Dec 14, 2025 •

edited

Loading