HelloWorldBeginner
HelloWorldBeginner
I found some errors in checks, how can I fix it? ``` examples/controlnet/train_controlnet_sdxl.py:16:1: I001 [*] Import block is un-sorted or un-formatted examples/text_to_image/train_text_to_image_lora_sdxl.py:18:1: I001 [*] Import block is un-sorted or un-formatted...
I've already fixed the code formatting issues in the checks.
I've separated the NPU flash attention into a module and implemented a switch control using parameters. I've tested it and it works.  @sayakpaul
> Thanks for working on this. > > For me, the following would be nice to add before we merge: > > * Documentation -- add an entry about the...
I've updated the code. @sayakpaul
Hi @sayakpaul. I noticed the PR is still open. Does the code still need review from others?
Hi @haotian-liu Can you review my codes?
I haven't used cpu offload in deepseed, but it's fine to use zero2 on 8xA100s.
> Thanks very much for your PR. Could we maybe split the PR into related sections please? > > 1. NPU related things > 2. DeepSpeed related things > >...
> Hi @HelloWorldBeginner , > > I have tried setting `num_workers=0`, but it seems that I have obtained the opposite conclusion that training with a higher value of `num_workers` (previously...