Abhi comments

Results 23 comments of


                                            Abhi

4. Text -> semantic tokens modeling

@jpc I have a suggestion, There is already a [text to semantic](https://github.com/suno-ai/bark/blob/f6f2db527b13c4a3e52ed6fbac587aadc3723eb6/bark/api.py#L8) tokens pipeline in project Bark. I wonder, can we directly use it or take it as a reference

Support for Hindi langauge

@gitmylo Thanks, I have trained for 15 epochs , and planning to do for 24. However it took around 3 hours on a P100 GPU for just 15 epochs. and...

Support for Hindi langauge

@gitmylo i think, Hubert base model doesn't support hindi language because my generated text doesn't speaks what's prompted with text , instead some random words and noises. Given that, I...

Support for Hindi langauge

@gitmylo yes, i just checked multiple wavs (BTW some files are pure noises too) in prepared folder and they sounds perfect. Can you please suggest by your experience what shall...

Support for Hindi langauge

@gitmylo , I could not found any searched a lot, it would be nice , if you may provide one link for it. P.S: And a point to be noted...

Support for Hindi langauge

@gitmylo I assume there is problem with hubert base model doesn't supports hindi , as i checked with the generated semantic_prompt , i converted them to wav form (sematic_to_waveform) as...

Support for Hindi langauge

A good news, i found a way of extracting semantic vectors from wav2vec models without the main hubert_base model.

Support for Hindi langauge

@gitmylo hey, i have one doubt , why haven't you used the [hubert_base_ls960_L9_km500.bin](https://dl.fbaipublicfiles.com/hubert/hubert_base_ls960_L9_km500.bin) quantizer ? And what's the reason of training for english language ??

Support for Hindi langauge

@gitmylo thanks, just one last question, Is it necessary to pass a input of size 768 to tokenizer, i mean that can we pass input of 1024 or something like...

Support for Hindi langauge

Thanks, Can you please shed lights on rate ? i mean what is the required rate ? > Make sure the Wav2Vec extracts features at the same rate as HuBERT...