OpenDiT
OpenDiT copied to clipboard
Reproduction of DiT
Thank you for your great work, and I have several issues:
First, I think this work is definitely going to improve the DiT training more efficiently. But I wonder whether this codebase could reproduce the results in the original DiT paper, such as FID.
Second, I observed that global seed is used in the training of the original DiT implementation, as follows. In your implementation, global seed is not used in the train.py. Would this affect the reproducibility?
- We have verified our accuracy on ImageNet. And we can achieve similar generation qualiy as the origin DiT. I think thats enough for us. We are now using our framework to train text-to-video models, which is more valuable for the current time being.
- Seed will be always set in our framework when we launch.
@oahzxl Could you further add the FID you obtain and corresponding training steps with this repo into the README? I think that will be more attractive for people want to understand more about it. Thanks anyway, really good job.
currently all of our resources is used for training video generation models. so maybe in future we will update our results on FID
Thanks for your rapid reply. I still feel puzzled about the global seed in OpenDiT/train.py. As in the following figure, global_seed is only seen in add_argument(). How does this seed be set in your framework? If I misunderstand the usage, could you provide more instructions? Thank you.
it will be set in colossai.launch. thanks for your suggestion, i will pass this arg to our launch func in future pr
We have already enabled passing the global seed during launch time, please check the latest commit! Thanks for your patience!
Thanks for your attention and reply.