ELLA icon indicating copy to clipboard operation
ELLA copied to clipboard

Ella can sometimes make already-correct results much less correct

Open Akira13641 opened this issue 1 year ago • 3 comments

RealCartoon 3D V15

Princess Peach is standing next to Tifa Lockhart, they are outside on a summer day, they are wearing bikinis. high quality, best quality, masterpiece

Without Ella: ComfyUI_14055_

With ELLA, same seed: ComfyUI_14056_

Using Ella in this case turns Princess Peach into a random pink-haired girl instead of the recognizable character.

Akira13641 avatar Apr 22 '24 23:04 Akira13641

During ELLA's training, a large number of synthetic captions were used, which typically do not include names or character names. Therefore, if your prompt contains a name, ELLA's performance is very poor. You can try replacing the name with 'a woman' and concatenate the output of CLIP.

budui avatar Apr 23 '24 02:04 budui

@budui Thanks for explanation. And this is what I think there should be tricks to generate the better synthetic captions or mixing short captions and synthetic long captions.

jyoung105 avatar May 02 '24 13:05 jyoung105

During ELLA's training, a large number of synthetic captions were used, which typically do not include names or character names. Therefore, if your prompt contains a name, ELLA's performance is very poor. You can try replacing the name with 'a woman' and concatenate the output of CLIP.

We want to prevent this issue on our end as well. What did you mean when you said "concatenate the output of CLIP"? Where would the name be introduced again?

andupotorac avatar Jun 14 '24 22:06 andupotorac