diffusers
diffusers copied to clipboard
ROPE computing suitable for NPU
What does this PR do?
In this way, the computing time cost for NPU will reduce about 3%-5%.
Before submitting
- [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
- [x] Did you read the contributor guideline?
- [x] Did you read our philosophy doc (important for complex PRs)?
- [ ] Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
- [ ] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
- [x] Did you write any new necessary tests?
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.
@sayakpaul Please take a look at this PR, thank you!
I will pause this PR for this moment, because it needs to some tests with FSDP2 in FLUX.2. Sorry for the inconvenience.
@sayakpaul @yiyixuxu The Flux.2 has been tested as well. The performance also increased. Thank you for your time :)