rope cache for flux and wan #3

TmacAaron · 2025-11-02T15:01:28Z

What does this PR do?

The original implementation of RoPE in Flux and Wan will calculate the rotary_embeddings in every steps. But actually the rotary_embeddings for each step is the same, so we only need to do the RoPE operation once for each inference.

This PR will save the cache once the rotary_embeddings is calculated, and use the cached rotary_embeddings in following steps. And Finally when inference finished, the cache will be released.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

rope cache

01a9b23

TmacAaron changed the title ~~rope cache~~ rope cache for flux and wan Dec 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rope cache for flux and wan #3

rope cache for flux and wan #3

Uh oh!

TmacAaron commented Nov 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rope cache for flux and wan #3

Are you sure you want to change the base?

rope cache for flux and wan #3

Uh oh!

Conversation

TmacAaron commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

TmacAaron commented Nov 2, 2025 •

edited

Loading