Abhi
Abhi
@jpc I have a suggestion, There is already a [text to semantic](https://github.com/suno-ai/bark/blob/f6f2db527b13c4a3e52ed6fbac587aadc3723eb6/bark/api.py#L8) tokens pipeline in project Bark. I wonder, can we directly use it or take it as a reference
@gitmylo Thanks, I have trained for 15 epochs , and planning to do for 24. However it took around 3 hours on a P100 GPU for just 15 epochs. and...
@gitmylo i think, Hubert base model doesn't support hindi language because my generated text doesn't speaks what's prompted with text , instead some random words and noises. Given that, I...
@gitmylo yes, i just checked multiple wavs (BTW some files are pure noises too) in prepared folder and they sounds perfect. Can you please suggest by your experience what shall...
@gitmylo , I could not found any searched a lot, it would be nice , if you may provide one link for it. P.S: And a point to be noted...
@gitmylo I assume there is problem with hubert base model doesn't supports hindi , as i checked with the generated semantic_prompt , i converted them to wav form (sematic_to_waveform) as...
A good news, i found a way of extracting semantic vectors from wav2vec models without the main hubert_base model.
@gitmylo hey, i have one doubt , why haven't you used the [hubert_base_ls960_L9_km500.bin](https://dl.fbaipublicfiles.com/hubert/hubert_base_ls960_L9_km500.bin) quantizer ? And what's the reason of training for english language ??
@gitmylo thanks, just one last question, Is it necessary to pass a input of size 768 to tokenizer, i mean that can we pass input of 1024 or something like...
Thanks, Can you please shed lights on rate ? i mean what is the required rate ? > Make sure the Wav2Vec extracts features at the same rate as HuBERT...