Clay Mullis

Results 40 issues of Clay Mullis

Hey! Great work on the original notebook. @mehdidc is working on a similar project now as well. I recommended your original CLIP Decision Transformer notebook; and came here to find...

Having trouble debugging this; but I think after looking at the code briefly - there's not any webdataset retrieval I dont think? The root problem is that I am not...

enhancement

Laion.ai recently has been donated access to a few 8xA100 pods, of which I'm able to use. Tinkering with using `cog` for inference on these is tricky because it is...

(disclaimer): this is code for training a _custom_ CLIP from the repository here, not the one in the OpenAI repo. For something like that I recommend open_clip. There are valid...

https://github.com/lucidrains/DALLE-pytorch/discussions/139#discussioncomment-560790 It appears as though adamw does work better but the weight decay is creating strange generations. Getting the same strange "brown" generations even though the loss continues to go...

https://gist.github.com/afiaka87/b29213684a1dd633df20cab49d05209d If there are any bugs - please make a comment below. When in doubt; restart your kernel. Tends to fix things a lot.

### Discussed in https://github.com/lucidrains/DALLE-pytorch/discussions/339 ![photooftheflag](https://user-images.githubusercontent.com/3994972/126043154-50e0aa44-4780-4c28-b722-57aa7ab6a840.png) ![an_illustration](https://user-images.githubusercontent.com/3994972/126043155-fbc22d4a-ac27-4160-9db2-3bb11006a87e.png) Originally posted by **afiaka87** July 17, 2021 I've been training a DALL-e with the goal of seeing whether or not a caption could be...

### Discussed in https://github.com/lucidrains/DALLE-pytorch/discussions/335 Originally posted by **afiaka87** July 11, 2021 [Full W&B training session](https://wandb.ai/dalle-pytorch-replicate/COCO512_16_16D_16H_80TSL) ![media_images_image_14100_c563c7f9470a4a3dd2c2](https://user-images.githubusercontent.com/3994972/125195299-b1814c00-e21a-11eb-8642-4e010dd8d113.png) ![media_images_image_14500_d5fdc93c3d9bba882b25](https://user-images.githubusercontent.com/3994972/125195303-b3e3a600-e21a-11eb-9b69-a3da7b075875.png) ![coco_trained](https://user-images.githubusercontent.com/3994972/125195424-248ac280-e21b-11eb-8231-cd9cede6d549.png) Details Transformer: - Visual Dim - 512 - Max Text Length/Language Dim...

In the cogview paper they claim that by giving the text as much importance they achieve a better result. They "hypothesize" that this is because the transformer is learning both...