Shreeram Chandra issues

Results 5 issues of


                                            Shreeram Chandra

EOS token not predicted while training from scratch

I am currently training S1 from scratch as described in the paper as an ablation study. The paper states that the authors use a decoder only architecture and a 12-layer...

Does the pre-trained model for hidden unit tokenizer use speaker embeddings?

Can you please elaborate on the role of speaker embeddings in the hidden unit tokenizer and what effect it has?

What is the time taken to converge for the hidden unit tokenizer?

I am currently training the hidden unit tokenizer to predict speech units from text token ids. Although the accuracy of the model continuously increases, I am unable to judge whether...

Link to train_960.tsv is broken

The link to the training data file seems to be broken : https://drive.google.com/file/d/1rxlikMglL2kEsF4NfqekZRoA02klY7CE/view?usp=sharing

Training text2vec

Thank you for putting up this code. I am interested in the txt2vec model (that you said works well in the other issue). Is the training stable? How long does...