MIGC A failure case occurred when MIGC generate "2 cats and 3 dogs".

A failure case occurred when MIGC generate "2 cats and 3 dogs".

Open saikouMonika opened this issue 11 months ago • 3 comments

When I used MIGC to generate "2 cats and 3 dogs", I found that the first dog below would still look like a "cat". Is there any way to improve this result?

I am using realisticVisionV51_v51VAE.safetensors, here are my parameters:

rompt_final = [['4k, best quality, masterpiece, ultra high res, ultra detailed,a cat,a cat,a dog,a dog,a dog,grass', 'a cat', 'a cat', 'a dog', 'a dog', 'a dog', 'grass']] bboxes = [[[0.078125, 0.09375, 0.390625, 0.359375], [0.515625, 0.09375, 0.859375, 0.359375], [0.078125, 0.515625, 0.34375, 0.90625], [0.421875, 0.515625, 0.671875, 0.921875], [0.71875, 0.484375, 0.953125, 0.921875], [0.015625, 0.015625, 0.984375, 0.96875]]] negative_prompt = 'worst quality, low quality, watermark, text, blurry' seed = 12573842233801288171 seed_everything(seed) image = pipe(prompt_final, bboxes, num_inference_steps=50, guidance_scale=8, MIGCsteps=25, NaiveFuserSteps=25, aug_phase_with_and=False, negative_prompt=negative_prompt).images[0]

And here are the images generated by MIGC:

be2a7472c1fa3ffcb38012ef9edc00c

89818c8daca02bf7ea39fefe4ce2c3b

Mar 16 '24 09:03 saikouMonika

MIGC MIGC copied to clipboard

A failure case occurred when MIGC generate "2 cats and 3 dogs".

MIGC
MIGC copied to clipboard