x-clip issues

Distributed training setup

1

PR for the distributed training setup.

Hi lucidrains, Try this and it will NaN within 100 steps (latest Github code). The loss looks fine before NaN. ``` import torch torch.backends.cudnn.allow_tf32 = True torch.backends.cuda.matmul.allow_tf32 = True torch.backends.cudnn.benchmark...

BlinkDL

Unable to train to convergence (small dataset)

9

Hi nice work with x-clip. Hoping to play around with it and eventually combine it into your DALLE2 work. Currently having some trouble training on roughly 30k image-text pairs. Loss...

jacobwjs

Suggest your favorite papers to add!

20

will start with 1. FILIP https://arxiv.org/abs/2111.07783 2. CLOOB https://arxiv.org/abs/2110.11316 3. https://arxiv.org/abs/2110.05208

lucidrains

x-clip
x-clip copied to clipboard

Metadata

Distributed training setup

NaN with mock data

Unable to train to convergence (small dataset)

Suggest your favorite papers to add!

extract features from CLIP with dim_text and dim_image other than 512

[DOCUMENTATION REQUEST] add demonstration for loading openai pretrained clip weights

Loss in -ve

some quesiton about x_clip

Text and Vision tokens different from CLIP

← Metadata

Owner

Metadata

x-clip x-clip copied to clipboard

Metadata

← Metadata

Owner

Metadata

x-clip
x-clip copied to clipboard