latent-diffusion-segmentation
latent-diffusion-segmentation copied to clipboard
The effect of VAE?
How about the performance of directly generating the (bit encoded) panoptic mask conditioned on an image?