Skip to content

[Issue]: sglang bench show long TTFT randomly in eager mode #105

@coderfeli

Description

@coderfeli

Problem Description

sglang bench show long TTFT randomly. It's related with triton kernel re-JIT.
When ran without cudagraph, triton jit is trigered sometimes. Need to set a huge triton cache size. Need to fix.

Operating System

linux

CPU

all

GPU

all

ROCm Version

all

ROCm Component

No response

Steps to Reproduce

No response

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions