GigaAM icon indicating copy to clipboard operation
GigaAM copied to clipboard

Train from scratch on the datasets used for finetune

Open vadimkantorov opened this issue 8 months ago • 0 comments

Hi!

Curious, do you provide baselines/checkpoints where you train from scratch on Golos+Sova+RCV+RLS including some models like FastConformer (hybrid CTC+RNNT)?

It would be helpful repro baselines, given that nvidia does not provide full training/data prep scripts for their public FastConformer models, and this baseline can probably be run without a ton of resources and still be useful for experimenting with the model architecture (e.g. positional encoding used)

Thanks!

vadimkantorov avatar Jun 18 '24 20:06 vadimkantorov