Stepfunction comments

Results 28 comments of


                                            Stepfunction

Lumina2Transformer2DModel.forward() got an unexpected keyword argument 'use_mask_in_transformer

This is frankly bizarre to me. In `transformer_lumina2.py`, in the definition of `Lumina2Transformer2DModel`, the `forward()` function clearly has `encoder_attention_mask` in it: ``` def forward( self, hidden_states: torch.Tensor, timestep: torch.Tensor, encoder_hidden_states:...

Lumina2Transformer2DModel.forward() got an unexpected keyword argument 'use_mask_in_transformer

Solved it! That was a weird issues with diffusers. The `Lumina2TransformerBlock` in `transformer_lumina2.py` has a different signature than the `forward` call does in the `Lumina2Transformer2DModel` call. It's possible to get...

Support Flux DoRA from SimpleTuner

I can confirm that I am experiencing the same thing on my end.

Flux.1 LoRA training

I can confirm that I am getting the same AttributeError as @jpXerxes after cloning the latest sd3 branch Able to bypass the issue and begin training by adding --cache_text_encoder_outputs to...

Flux.1 LoRA training

You can also remove the sd scripts directory and replace it with the latest version of the sd3 branch. On Thu, Aug 15, 2024, 3:09 PM jpXerxes ***@***.***> wrote: >...

Flux.1 LoRA training

With a 24GB card, I run out of VRAM after about 30 or so training steps.

Support Multi-Resolution Training

I understand the potential for the dataset configuration file, but it's a little redundant if you want the same dataset at 3 different resolutions. It could definitely be constructed automatically...

Support Multi-Resolution Training

What exactly is split mode and train blocks?

Support Multi-Resolution Training

Very much appreciate the response. Thank you!

Flex.1 Alpha LoRA/Finetuning

My initial attempt with a LR or 1e-5 overtrained rapidly. A second attempt with a LR of 2e-6 seems to be more stable so far.