sd-scripts issues

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args

5

Hi, There's a popular discussion thread that suggests training the `proj_out` (`linear2)` module of single blocks 7 and 20 for Flux LoRAs: https://old.reddit.com/r/StableDiffusion/comments/1f523bd/good_flux_loras_can_be_less_than_45mb_128_dim/ As far as I can tell, it...

EricBCoding

Strange behavior with --network_args parameter

Adding --network_args to Additional Parameters, even without arguments, sets certain settings to their default or unknown values. It's not clear what these settings are. For example, with --network_args and --cache_text_encoder_outputs...

codecomputation

Flux merge tool does not properly handle LoRA keys from AI-Toolkit when merging w/ Flux.dev model - All keys "Unused" / Unchanged model file outputted

9

Hello, Currently the featuring to merge Flux LoRA into the base model does not function properly with LoRA trained by Ostris' AI-Toolkit. This appears to be due to a difference...

CCpt5

Flux fine-tune text encoder rate is too high; need to be able to set separately

2

Hello! I'm using flux_train.py from the sd3 branch to fine-tune Flux on a custom data set. It's working, but I'm finding that the text encoders are getting over-trained very quickly....

BelieveDiffusion

Update submodules as well since MPS changes were needed

WORK IN PROGRESS - For POC Part of changes for https://github.com/bmaltais/kohya_ss/pull/2830 I'm not sure on these changes at all... Most (if not all) I didn't make. I think this is...

JoeyOverby

(sd3 branch Flux LoRA Training) RuntimeError: "index_select_cuda" not implemented for 'Float8_e4m3fn'

2

In issue #1453 I made a PR #1452 to fix AttributeError: 'T5EncoderModel' object has no attribute 'text_model', while loading T5 into GPU without 'cache_text_encoder_outputs'. But I did't check `T5EncoderModel` dict.😅...

fireicewolf

[Feature request] --caption_dropout_rate when cache te output

[flux-style-captioning-differences-training-diary](https://civitai.com/articles/6792/flux-style-captioning-differences-training-diary) [flux-style-captioning-differences-pt2-4-new-caption-tools-training-diary](https://civitai.com/articles/7146/flux-style-captioning-differences-pt2-4-new-caption-tools-training-diary) [flux-is-smarter-than-you-and-other-surprising-findings-on-making-the-model-your-own](https://civitai.com/articles/6982/flux-is-smarter-than-you-and-other-surprising-findings-on-making-the-model-your-own) Some FLUX training experience comparisons point out that FLUX performs well even when trained without captions. However, when testing with entirely captionless data, it becomes difficult to...

gesen2egee

enhancement

poolFD failed error when caching latents for flux finetuning

4

@kohya-ss When I fine-tune Flux with 18,000 images, after caching the latents, the following error occurs. What could be the problem?Is this a bug, or is it because the data...

huxian0402

error when use deepspeed for FLUX.1 fine-tuning

@kohya-ss @lansing @rockerBOO @akx @tsukimiya With the following configuration, multi-GPU training works properly, and the results are normal. Does sd-scripts not support DeepSpeed acceleration? Could you help me check it?...

huxian0402

Collecting image sizes is very slow.

7

When I train with 4000 images, it takes forever to start training because just collecting the image sizes from the npz filenames takes 20 minutes from the drive where I...

markrmiller

sd-scripts
sd-scripts copied to clipboard

Metadata

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args

Strange behavior with --network_args parameter

Flux merge tool does not properly handle LoRA keys from AI-Toolkit when merging w/ Flux.dev model - All keys "Unused" / Unchanged model file outputted

Flux fine-tune text encoder rate is too high; need to be able to set separately

Update submodules as well since MPS changes were needed

(sd3 branch Flux LoRA Training) RuntimeError: "index_select_cuda" not implemented for 'Float8_e4m3fn'

[Feature request] --caption_dropout_rate when cache te output

poolFD failed error when caching latents for flux finetuning

error when use deepspeed for FLUX.1 fine-tuning

Collecting image sizes is very slow.

← Metadata

Owner

Metadata

sd-scripts sd-scripts copied to clipboard

Metadata

← Metadata

Owner

Metadata

sd-scripts
sd-scripts copied to clipboard