Bagheera comments

Results 446 comments of


                                            Bagheera

Fix `latents.dtype` before `vae.decode()` at ROCm devices in `StableDiffusionPipeline`s

so i can actually hit this one on CUDA, **but** it only happens during training without autocast 🤔 the weights are in bf16 precision, not fp32. maybe this is what...

new from_single_file implementation is always internet-first and using local files only on timeout

i'm constantly updating my models on the hub and this behaviour means that the latest model is always loaded for inference. disabling the network for the user when they have...

training example for instruct pix2pix doesn't zero out embeds

the base model was trained using it, so i figured aligning with the base model's training and inference has better results. from my own tests, i can now reduce the...

training example for instruct pix2pix doesn't zero out embeds

see the base SDXL pipeline: ```py # get unconditional embeddings for classifier free guidance zero_out_negative_prompt = negative_prompt is None and self.config.force_zeros_for_empty_prompt if do_classifier_free_guidance and negative_prompt_embeds is None and zero_out_negative_prompt: negative_prompt_embeds...

training example for instruct pix2pix doesn't zero out embeds

can i also open the pull request for all of the other training examples, to add general dropout capabilities to them?

training example for instruct pix2pix doesn't zero out embeds

like the ticket for updating the fp16 error? #6231

[docs] Scheduler features

a good demonstration of the current generation of models' two primary forms of residual noise would probably be a good idea though i can't think of how to integrate that....

fix for #7365, prevent pipelines from overriding provided prompt embeds

wow, i hadn't expected all of those, or really looked into them until now. i'm not sure why this simple check made those tests fail. i am wondering if there's...

Support Lumina T2I 5B flow matching T2I DiT model

i was working on this support but after seeing the results of the model i'm not sure it's ready to be added yet: ![image](https://github.com/huggingface/diffusers/assets/59658056/410eb686-c0bf-486c-b0eb-efdfd0243517) there's a lot of residual noise...

Support Lumina T2I 5B flow matching T2I DiT model

@JincanDeng how are you doing caption dropout? zeroes or `""` prompt encoded by both TEs?