Make-A-Story How much VRAM is needed for this?

How much VRAM is needed for this?

Open Echo411 opened this issue 1 year ago • 1 comments

This looks great!

Could you share some information on what setup you used for the training of the transformer model?

It would be helpful to have these information to better understand the cost of training models.

Jul 05 '23 06:07 Echo411

We used one GPU which is A600.
Mugen is a large dataset that took longer to train. We used 3 epochs for the dataset. For other two we used 100+ epochs.
For Mugen, flintstone and proro we used batch size 24, 12 and 16 respectively.

Jul 25 '23 23:07 trahman8