feed_forward_vqgan

Results 9 feed_forward_vqgan_clip issues

Sort by recently updated

Positional Stickiness

For lack of a better word; I've noticed during training that the VitGAN tends to get stuck on one, two, or three (i don't see four happen very often/at all)...

afiaka87

Slow Training Speed

Hi, First of all great work! I really loved it. To replicate, I tried training on the Conceptual 12M Dataset with the depth and dims same as the pretrained models...

s13kman

training GPU configuration

Thanks for your excellent repo. When training `cc12m_32x1024` with type `VitGAN` or `MLP Mixer`, what kinds of GPU environment do you use? Tesla V100 with 32G mem or others? Thanks

CrossLee1

Finetuing CLIP to improve domain-specific performance

It's quite easy to finetune one of the Open AI CLIP checkpoints with this codebase: https://github.com/Zasder3/train-CLIP-FT Uses pytorch-lightning. May be worth pursuing

afiaka87

clarifying differences between available models

Hi @mehdidc 👋🏼 I'm a new team member at @replicate. I was [trying out your model on replicate.ai](https://replicate.ai/mehdidc/feed_forward_vqgan_clip) and noticed that the names of the models are a bit cryptic,...

zeke

How to improve so we could get results closer to the "regular" VQGAN+CLIP?

Hi! I really love this idea and think that this concept solves the main bottleneck of current VQGAN+CLIP approach which is the optimisation for each prompt. I love how instantaneous...

apolinario

CLIP-guided-diffusion updates

Katherine has released a better notebook for the CLIP-guided-diffusion. Outputs on a P100 is quite slow; but results can be very good. I've put the new notebook in my current...

afiaka87

Not an issue - richer datasets

are you familiar with this https://twitter.com/e08477/status/1418440857578098691?s=21 ? I want to do cityscape shots. Are you familiar with any relevant datasets? Can this repo help output higher quality images? Or does...

johndpope

How to get more variation in the null image

I've been generating images using this model, which is delightfully fast, but I've noticed that it produces images that are all alike. I tried generating the "null" image by doing:...

kchodorow

feed_forward_vqgan_clip
feed_forward_vqgan_clip copied to clipboard

Metadata

Positional Stickiness

Slow Training Speed

training GPU configuration

Finetuing CLIP to improve domain-specific performance

clarifying differences between available models

How to improve so we could get results closer to the "regular" VQGAN+CLIP?

CLIP-guided-diffusion updates

Not an issue - richer datasets

How to get more variation in the null image

← Metadata

Owner

Metadata

feed_forward_vqgan_clip feed_forward_vqgan_clip copied to clipboard

Metadata

← Metadata

Owner

Metadata

feed_forward_vqgan_clip
feed_forward_vqgan_clip copied to clipboard