Skip to content

My RTF is always around 0.12 with TRT-LLM docker on L20, never match the claimed benchmark result #1214

@qiulang

Description

@qiulang

Checks

  • This template is only for usage issues encountered.
  • I have thoroughly reviewed the project documentation but couldn't find information to solve my problem.
  • I have searched for existing issues, including closed ones, and couldn't find a solution.
  • I am using English to submit this issue to facilitate community communication.

Environment Details

nvidia L20, soar97/triton-f5-tts:24.12, just as https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/runtime/triton_trtllm/README.md

But my RTF is always around 0.12

⚡ Real-Time Factor:
   Mean: 0.120x
   Median: 0.112x
   Min/Max: 0.097x / 0.152x

My result is much slower than the claimed benchmark result.

What did I do wrongly ?

Steps to Reproduce

  1. find a clean L20 machine, clone the project
  2. start the triton docker MODEL=F5TTS_v1_Base docker compose up as https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/runtime/triton_trtllm/README.md said
  3. test the RTF

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentationenhancementNew feature or requesthelp wantedExtra attention is needed

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions