Sana icon indicating copy to clipboard operation
Sana copied to clipboard

dreambooth Lora training failure

Open yayueLaurazhou opened this issue 7 months ago • 2 comments

I am using the image dataset used in dreambooth paper here https://github.com/google/dreambooth/tree/main/dataset/can

on model Efficient-Large-Model/Sana_1600M_1024px_BF16_diffusers

accelerate launch /mnt/sdc/zhouyayue/projects/Sana/train_scripts/train_dreambooth_lora_sana.py \
  --pretrained_model_name_or_path="Efficient-Large-Model/Sana_1600M_1024px_BF16_diffusers" \
  --instance_data_dir="/mnt/sdc/zhouyayue/projects/Sana/dreambooth_data" \
  --output_dir="/mnt/sdc/zhouyayue/projects/Sana/output_dreambooth_lora" \
  --mixed_precision="bf16" \
  --instance_prompt="a photo of a <edfe> can" \
  --resolution=1024 \
  --train_batch_size=1 \
  --gradient_accumulation_steps=4 \
  --use_8bit_adam \
  --learning_rate=1e-6 \
  --report_to="wandb" \
  --lr_scheduler="constant" \
  --lr_warmup_steps=0 \
  --max_train_steps=800 \
  --validation_prompt="ground level view, low camera angle, a realistic photo of a <edfe> can on terrazo tile floor" \
  --validation_epochs=50 \
  --seed="0" \
  --push_to_hub 

I changed the learning rate from 1e-4 to 1e-6, the result is below

Image

at 100 epochs, this are the validation images Image

Image

Image

after 700 epochs, it gives me photos that doesn't even look like a can, not to mention subject specific features, Image

Image

what is wrong with my training?

yayueLaurazhou avatar Apr 06 '25 14:04 yayueLaurazhou

it's the model, not your training.

bghira avatar Apr 15 '25 20:04 bghira

can you told to me can we test our diffuser sana lora dreambooth checkpoints model? do you have any workflow?

nanaj96 avatar Apr 17 '25 16:04 nanaj96