Alexandru Papiu comments

Results 11 comments of


                                            Alexandru Papiu

Positional Encoding

@albertfgu I just used learned positional encodings - the sequence length was 64 (4 by 4 patches for a 32 by 32 image). I will try to reproduce the results...

Weird image when running the sample generator

Hey @metatl, try using the legacy_dh_order branch - the model was trained with a small but annoying difference in ordering of the hidden dimensions and head dimensions and unfortunately I...

Low image quality even after 300k steps?

Hey @aabzaliev - interesting results. Agreed the results aren't great (but kind of interesting too). I think it could be a few things. 1. The noise distribution - the defaults...

Adding Flash attention

Hey the model uses https://pytorch.org/docs/stable/generated/torch.nn.functional.scaled_dot_product_attention.html which should already use flash attention.

Can you share full training code? Thank you!

Hi @ericwudocomoi I added a [notebook](https://colab.research.google.com/drive/1sKk0usxEF4bmdCDcNQJQNMt4l9qBOeAM) in the https://github.com/apapiu/transformer_latent_diffusion?tab=readme-ov-file#usage tab: if you look in the notebook it will download some already preprocessed data including the val encodings and do a...

Can you share full training code? Thank you!

@ericwudocomoi Ok added another notebook in the https://github.com/apapiu/transformer_latent_diffusion?tab=readme-ov-file#usage subsection that should help you preprocess the images and text on your own dataset. Let me know if this helps and you're...

Alexandru Papiu

Positional Encoding

Weird image when running the sample generator

Low image quality even after 300k steps?

Adding Flash attention

Can you share full training code? Thank you!

Can you share full training code? Thank you!

bad marshal data (unknown type code)

Training sample from custom dataset

concatenate the conditional and unconditional inputs to speed inference

concatenate the conditional and unconditional inputs to speed inference