PhoWhisper icon indicating copy to clipboard operation
PhoWhisper copied to clipboard

Is it possible to train the model with multi lingual languages ?

Open leviethung2103 opened this issue 11 months ago • 1 comments

Hi VinAI Team,

Given a audio that speaker mainly speaks 90% of time in Vietnamese, 10% of time in English. I've tested your model with this type of audio and English words are interpreted as Vietnamese language.

I am thinking of re-train the model with a dataset that contains the English and Vietnamese in transcript. Do you think this approach is feasible or not ?

Thank you

leviethung2103 avatar Mar 01 '24 10:03 leviethung2103