MIGC
MIGC copied to clipboard
A failure case occurred when MIGC generate "2 cats and 3 dogs".
When I used MIGC to generate "2 cats and 3 dogs", I found that the first dog below would still look like a "cat". Is there any way to improve this result?
I am using realisticVisionV51_v51VAE.safetensors, here are my parameters:
rompt_final = [['4k, best quality, masterpiece, ultra high res, ultra detailed,a cat,a cat,a dog,a dog,a dog,grass', 'a cat', 'a cat', 'a dog', 'a dog', 'a dog', 'grass']] bboxes = [[[0.078125, 0.09375, 0.390625, 0.359375], [0.515625, 0.09375, 0.859375, 0.359375], [0.078125, 0.515625, 0.34375, 0.90625], [0.421875, 0.515625, 0.671875, 0.921875], [0.71875, 0.484375, 0.953125, 0.921875], [0.015625, 0.015625, 0.984375, 0.96875]]] negative_prompt = 'worst quality, low quality, watermark, text, blurry' seed = 12573842233801288171 seed_everything(seed) image = pipe(prompt_final, bboxes, num_inference_steps=50, guidance_scale=8, MIGCsteps=25, NaiveFuserSteps=25, aug_phase_with_and=False, negative_prompt=negative_prompt).images[0]
And here are the images generated by MIGC: