Steffen Ohrendorf
Results
132
comments of
Steffen Ohrendorf
This looks like 2500 to me (or 50*50), unless I'm misunderstanding something.  Some of the checkpoints need 1.5 hours or more (I guess it depends on the train data...
I don't have top-notch hardware, it's an RTX 2060 with 12 GB of VRAM. The batch size which maximizes VRAM usage is 8. Anything above that results in an OOM...