Aaron (Yinghao) Li

Results 110 comments of Aaron (Yinghao) Li

@joe-none416 Why is it not good?

I'm gonna pin this in case someone else has similar problems. I don't know how to deal with this because I can't reproduce this problem with the oldest GPU I...

@ruby11dog could you please share the details to reproduce this problem? It seems it’s not related to the version of GPUs then?

So weird, I tried in Colab (T4, V100 and A100) without specifying any version of the libraries it works perfectly fine: https://colab.research.google.com/drive/1k5OqSp8a-x-27xlaWr2kZh_9F9aBh39K I'm really wondering what is the reason behind...

The repo so far is a research project and its main purpose serves more as a proof of concept for the paper than a full-fledged open source project. I agree...

Multilingual speech datasets are more difficult to get than language datasets. XPhoneBERT for example was trained entirely on Wikipedia in 100+ languages, but getting 100+ languages of speech data with...

If you want cross-lingual generalization, I think each language should be at least 100 hours. The data you provide probably is good for a single speaker model, but not enough...

@hobodrifterdavid Thanks so much for your help. What you have now is probably good for multilingual PL-BERT training as long as you can keep this machine running for at least...

I think the GPUs provided by @hobodrifterdavid would be a great start for multilingual PL-BERT training. Before proceeding though, I need some people who speak as many languages as possible...

@SoshyHayami Thanks for your willingness to help. Fortunately, I think most other languages that have whitespaces between words can be handled with the same logic. The only supported languages that...