MDT icon indicating copy to clipboard operation
MDT copied to clipboard

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Results 17 MDT issues
Sort by recently updated
recently updated
newest added

Thanks for the wonderful work again and I still wants to do some experiments on the small models like B/2 or S/2 due to my limited resources. Will you guys...

Dear Authors, when i use one gpu to train the model : `python -m torch.distributed.launch --nproc_per_node=1 scripts/image_train.py` and use 4 gpu to train the model: `python -m torch.distributed.launch --nproc_per_node=4 scripts/image_train.py`...

hello! when I am training with one gpu. I encountered with a problem: ![image](https://github.com/sail-sg/MDT/assets/163538949/a8db18e6-3252-4911-8d4e-10ca2ff531d1) and I set the script like this ![image](https://github.com/sail-sg/MDT/assets/163538949/64eaff55-a8d6-4b82-91e4-a53cf7d6f311) I don't know which leaf Variable that requires...

Great works! However, I also wonder that the reconstruction loss (MSE Loss) in MAE is lost in the code. I couldn't find it. Does the MDT computes the noising prediction...

Hello, I would like to know how this sentence in the paper is achieved: "In MDT, both the full and remaining unmasked latent embeddings are fed into the diffusion model...

Changed hugglingface to huggingface

From the loss of mse and m_mse, it seems that the mask branch does not work in MDT-S-2. We also visualize the generation image and find that generated image with...

Batch size is said to be 256 in the article. But why batch size in run.sh is 32? And why batch size in run_ddp_master.sh is 4?

Hello everyone. I would like to know about the memory requirements of the model. I am trying to run the original model on 2 48GB GPUs through PyTorch DDP. However,...