Naga Sai Abhinay

Results 5 issues of Naga Sai Abhinay

The checkpoint merger pipeline based on the discussion at #877. Tested the checkpoint merger for the case with two checkpoints.

Added the code sample in the README.md

I'm training a new tokenizer on an Indic language, Tamil. I tried two different runs: Test run with part of the data used for training ~0.3Gb ``` from datasets import...

- [x] Add English text normlaization. - [x] Add WER calculations. - [x] Compare and check norm outputs from Python implementation - [x] Add WER to the regression tests once...

enhancement

TODO: From Zack: > The tricky part is keeping track of timestamps as we iterate through, there can be drift > If we just do every 30s we’re going to...