Thiago Soares Laitz
Results
2
comments of
Thiago Soares Laitz
@A9isha Hello, do you by any chance have a script that does the opposite, converting HF to Orbax?
I'm training bigger models than before, so I can't use the same batch size on the same TPU. Got any recommended ablation studies on using gradient accumulation versus lowering the...