Skip to content

[疑问] 在dp=8,max_num_seq=160场景下,为什么用aisbench跑达不到预期并发160(每个dp)。但是用curl的方式能达到160。 #181

@chopper0126

Description

@chopper0126

疑问描述

ais_bench --models vllm_api_stream_chat --datasets synthetic_gen --mode perf

前置检查

  • 我已读懂主页文档的快速入门,无法解答我的疑惑

Metadata

Metadata

Assignees

No one assigned

    Labels

    content_check_failedissue content check failedquestionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions