Skip to content

Does not use native support for function/tool calling #9

@zainhas

Description

@zainhas

The function calls do not pass in the tools using the tools and tool_choice params and thus we are testing tool calling using only model's normal text generation capability and not the native support for function/tool calling that a lot of models now have.

Image

For example the Berkeley Function-Calling Leaderboard enables both evaluating native support FC and prompt based.

It would be good to have a version of Acebench that implements FC's in the following manner:

tools = [
  {
    "type": "function",
    "function": {
      "name": "get_current_weather",
      # ...
    }
  },
  {
    "type": "function",
    "function": {
      "name": "get_current_stock_price",
      # ...
    }
  }
]

response = client.chat.completions.create(
    model="Qwen/Qwen2.5-7B-Instruct-Turbo",
    messages=[
      {"role": "user", "content": "What's the current price of Apple's stock?"},
    ],
    tools=tools,
    tool_choice={"type": "function", "function": {"name": "get_current_stock_price"}}
)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions