Clay Mullis issues

Results 40 issues of


                                            Clay Mullis

Similar ideas

Hey! Great work on the original notebook. @mehdidc is working on a similar project now as well. I recommended your original CLIP Decision Transformer notebook; and came here to find...

`clip-retrieval filter` webdataset support

Having trouble debugging this; but I think after looking at the code briefly - there's not any webdataset retrieval I dont think? The root problem is that I am not...

enhancement

Specifying GPU rank on multi-GPU systems

Laion.ai recently has been donated access to a few 8xA100 pods, of which I'm able to use. Tinkering with using `cog` for inference on these is tricky because it is...

Image generation with deepspeed --fp16

Train a custom CLIP with DeepSpeed CPU offload, 16 bit precision

(disclaimer): this is code for training a _custom_ CLIP from the repository here, not the one in the OpenAI repo. For something like that I recommend open_clip. There are valid...

"adamw" optimizer + weight decay = poor generations

https://github.com/lucidrains/DALLE-pytorch/discussions/139#discussioncomment-560790 It appears as though adamw does work better but the weight decay is creating strange generations. Getting the same strange "brown" generations even though the loss continues to go...

(colab notebook) Train DALLE-pytorch on C@H

https://gist.github.com/afiaka87/b29213684a1dd633df20cab49d05209d If there are any bugs - please make a comment below. When in doubt; restart your kernel. Tends to fix things a lot.

Getting Text to Output a Font/Handwriting of the Same Text

### Discussed in https://github.com/lucidrains/DALLE-pytorch/discussions/339 ![photooftheflag](https://user-images.githubusercontent.com/3994972/126043154-50e0aa44-4780-4c28-b722-57aa7ab6a840.png) ![an_illustration](https://user-images.githubusercontent.com/3994972/126043155-fbc22d4a-ac27-4160-9db2-3bb11006a87e.png) Originally posted by **afiaka87** July 17, 2021 I've been training a DALL-e with the goal of seeing whether or not a caption could be...

20 Epochs on COCO - (Larger Transformer)

### Discussed in https://github.com/lucidrains/DALLE-pytorch/discussions/335 Originally posted by **afiaka87** July 11, 2021 [Full W&B training session](https://wandb.ai/dalle-pytorch-replicate/COCO512_16_16D_16H_80TSL) ![media_images_image_14100_c563c7f9470a4a3dd2c2](https://user-images.githubusercontent.com/3994972/125195299-b1814c00-e21a-11eb-8642-4e010dd8d113.png) ![media_images_image_14500_d5fdc93c3d9bba882b25](https://user-images.githubusercontent.com/3994972/125195303-b3e3a600-e21a-11eb-9b69-a3da7b075875.png) ![coco_trained](https://user-images.githubusercontent.com/3994972/125195424-248ac280-e21b-11eb-8231-cd9cede6d549.png) Details Transformer: - Visual Dim - 512 - Max Text Length/Language Dim...

CogView Think Image and Text Should be weighted the same

In the cogview paper they claim that by giving the text as much importance they achieve a better result. They "hypothesize" that this is because the transformer is learning both...