My RTF is always around 0.12 with TRT-LLM docker on L20, never match the claimed benchmark result

### Checks

- [x] This template is only for usage issues encountered.
- [x] I have thoroughly reviewed the project documentation but couldn't find information to solve my problem.
- [x] I have searched for existing issues, including closed ones, and couldn't find a solution.
- [x] I am using English to submit this issue to facilitate community communication.

### Environment Details

nvidia L20, soar97/triton-f5-tts:24.12, just as https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/runtime/triton_trtllm/README.md

But my RTF is always around 0.12

```
⚡ Real-Time Factor:
   Mean: 0.120x
   Median: 0.112x
   Min/Max: 0.097x / 0.152x
```

My result is much slower than the claimed benchmark result.

What did I do wrongly ?

### Steps to Reproduce

1. find a clean L20 machine, clone the project
2. start the triton docker `MODEL=F5TTS_v1_Base docker compose up` as https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/runtime/triton_trtllm/README.md said
3. test the RTF


### ✔️ Expected Behavior

_No response_

### ❌ Actual Behavior

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

My RTF is always around 0.12 with TRT-LLM docker on L20, never match the claimed benchmark result #1214

Checks

Environment Details

Steps to Reproduce

✔️ Expected Behavior

❌ Actual Behavior

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

My RTF is always around 0.12 with TRT-LLM docker on L20, never match the claimed benchmark result #1214

Description

Checks

Environment Details

Steps to Reproduce

✔️ Expected Behavior

❌ Actual Behavior

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions