Naga Sai Abhinay
Naga Sai Abhinay
The checkpoint merger pipeline based on the discussion at #877. Tested the checkpoint merger for the case with two checkpoints.
Added the code sample in the README.md
I'm training a new tokenizer on an Indic language, Tamil. I tried two different runs: Test run with part of the data used for training ~0.3Gb ``` from datasets import...
- [x] Add English text normlaization. - [x] Add WER calculations. - [x] Compare and check norm outputs from Python implementation - [x] Add WER to the regression tests once...
TODO: From Zack: > The tricky part is keeping track of timestamps as we iterate through, there can be drift > If we just do every 30s we’re going to...