fast-stable-diffusion icon indicating copy to clipboard operation
fast-stable-diffusion copied to clipboard

Intermediate Checkpoints are not being saved

Open mandal-rahul opened this issue 1 year ago • 12 comments

Im running for 4000steps with saving model every 500steps starting from 1500. But still it doesn’t save the intermediate steps and just at the end of 4000steps it dumps the final checkpoint in gdrive

mandal-rahul avatar Oct 26 '22 20:10 mandal-rahul

Did you check the box "Save_Checkpoint_Every_n_Steps" ?

TheLastBen avatar Oct 26 '22 20:10 TheLastBen

Yes I did, still it wont save the ckpt files. Also im using the old method

mandal-rahul avatar Oct 27 '22 08:10 mandal-rahul

I'll check it out

TheLastBen avatar Oct 27 '22 09:10 TheLastBen

Yes, I just tried the old method too, and it didn't save any intermediate checkpoints.

mauzus avatar Oct 28 '22 06:10 mauzus

Did you test it with the new method ?

sv

TheLastBen avatar Oct 28 '22 09:10 TheLastBen

The new method doest work perfectly, but the old method doesnt save intermediate checkpoints

mandal-rahul avatar Oct 28 '22 12:10 mandal-rahul

I'll fix the issue with the old method

I'll walk you through the new method, just tell me how many instance images you have and what subject you're training

TheLastBen avatar Oct 28 '22 12:10 TheLastBen

Thanks, seems like the old method is working fine now. I have 130 instance images, and the subject is man. In the new approach I have trained for 9k steps with saving ckpt every 500 starting from 1000. I did evaltuation of each ckpt (starting from 1000 steps till 9k steps) but for me the old approach gives me best result.

mandal-rahul avatar Oct 28 '22 14:10 mandal-rahul

Try this :

  • Pick the best 15 pictures from your dataset, rename all of them to "pbftefbdg", so you'll get : pbftefbdg (1) .... pbftefbdg (15),
  • Run the new method cells, and upload the 15 images, make the images contain closeups.
  • Set the steps to 2000 (should take 30-40 minutes), keep the box fp16 checked.

When done, try this exact prompt :

(pbftefbdg), award winning photo by Patrick Demarchelier , 20 megapixels, 32k definition, fashion photography, ultra detailed, precise, elegant

Negative prompt: ((((ugly)))), (((duplicate))), ((morbid)), ((mutilated)), [out of frame], extra fingers, mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((mutation))), (((deformed))), ((ugly)), blurry, ((bad anatomy)), (((bad proportions))), ((extra limbs)), cloned face, (((disfigured))), out of frame, ugly, extra limbs, (bad anatomy), gross proportions, (malformed limbs), ((missing arms)), ((missing legs)), (((extra arms))), (((extra legs))), mutated hands, (fused fingers), (too many fingers), (((long neck)))

Steps: 45, Sampler: DPM2 a Karras, CFG scale: 8.5, Seed: 2871323065, Size: 512x704, Model hash: ef85023d, Denoising strength: 0.7, First pass size: 0x0

Let me know the result

TheLastBen avatar Oct 28 '22 14:10 TheLastBen

sure thing, will give it a try tomorrow

mandal-rahul avatar Oct 28 '22 15:10 mandal-rahul

It's still broken, unfortunately. Sometimes it works, sometimes it doesn't for some reason... It's so painful to waste a lot of time and only get an overtrained model. 😭

I'll have to learn the new method and see if I have better luck. 😰

mauzus avatar Nov 01 '22 14:11 mauzus

The new method doesn't need to be learned, it's only 2 steps

TheLastBen avatar Nov 01 '22 14:11 TheLastBen