VoiceCraft icon indicating copy to clipboard operation
VoiceCraft copied to clipboard

about silence tokens during inference

Open thivux opened this issue 1 year ago • 1 comments

i see that the default values for silence_tokens during inference are [1388,1898,131]. my questions:

  1. why is there more than one silence token?
  2. how do silence_tokens differ from the <SIL> phoneme in vocab.txt?
  3. how can i find the silence tokens when training on my own dataset?

thivux avatar Jul 03 '24 17:07 thivux

@thivux Have figured it out? At least how can one find the silence tokens?

royrs avatar May 08 '25 08:05 royrs