sd-scripts
sd-scripts copied to clipboard
Hi, There's a popular discussion thread that suggests training the `proj_out` (`linear2)` module of single blocks 7 and 20 for Flux LoRAs: https://old.reddit.com/r/StableDiffusion/comments/1f523bd/good_flux_loras_can_be_less_than_45mb_128_dim/ As far as I can tell, it...
Adding --network_args to Additional Parameters, even without arguments, sets certain settings to their default or unknown values. It's not clear what these settings are. For example, with --network_args and --cache_text_encoder_outputs...
Hello, Currently the featuring to merge Flux LoRA into the base model does not function properly with LoRA trained by Ostris' AI-Toolkit. This appears to be due to a difference...
Hello! I'm using flux_train.py from the sd3 branch to fine-tune Flux on a custom data set. It's working, but I'm finding that the text encoders are getting over-trained very quickly....
WORK IN PROGRESS - For POC Part of changes for https://github.com/bmaltais/kohya_ss/pull/2830 I'm not sure on these changes at all... Most (if not all) I didn't make. I think this is...
In issue #1453 I made a PR #1452 to fix AttributeError: 'T5EncoderModel' object has no attribute 'text_model', while loading T5 into GPU without 'cache_text_encoder_outputs'. But I did't check `T5EncoderModel` dict.😅...
[flux-style-captioning-differences-training-diary](https://civitai.com/articles/6792/flux-style-captioning-differences-training-diary) [flux-style-captioning-differences-pt2-4-new-caption-tools-training-diary](https://civitai.com/articles/7146/flux-style-captioning-differences-pt2-4-new-caption-tools-training-diary) [flux-is-smarter-than-you-and-other-surprising-findings-on-making-the-model-your-own](https://civitai.com/articles/6982/flux-is-smarter-than-you-and-other-surprising-findings-on-making-the-model-your-own) Some FLUX training experience comparisons point out that FLUX performs well even when trained without captions. However, when testing with entirely captionless data, it becomes difficult to...
@kohya-ss When I fine-tune Flux with 18,000 images, after caching the latents, the following error occurs. What could be the problem?Is this a bug, or is it because the data...
@kohya-ss @lansing @rockerBOO @akx @tsukimiya With the following configuration, multi-GPU training works properly, and the results are normal. Does sd-scripts not support DeepSpeed acceleration? Could you help me check it?...
When I train with 4000 images, it takes forever to start training because just collecting the image sizes from the npz filenames takes 20 minutes from the drive where I...