Vineel Pratap comments

Results 41 comments of


                                            Vineel Pratap

Update README.md

Hi, this is not a bug. You can pass multiple audio files in the command.

hypo.word file missing during MMS ASR inference

Hi, can you share the entire log? I just tested the code again and it works fine from my end.

hypo.word file missing during MMS ASR inference

@audiolion We expect a 3-digit language code. See 'Supported languages' section in README file for each model. For example - use 'eng' for English.

hypo.word file missing during MMS ASR inference

@shsagnik `No module named 'editdistance'` - You should install the missing module.

MMS - Forced Alignment: Array shapes problem

Hi, can you change `torch.cat(emissions_arr, dim=1)` --> `torch.cat(emissions_arr, dim=-1)` in `align_and_segment.py` file. I'll send a PR to fix the code soon.

MMS - Forced Alignment: Array shapes problem

Hi, I just landed the fix in https://github.com/facebookresearch/fairseq/pull/5133. Please use the updated code.

How to convert mms model to hf model?

Hi, MMS uses Transformer layer with an additional adapter module which is not used on original wav2vec2.0. See - https://github.com/facebookresearch/fairseq/blob/main/fairseq/models/wav2vec/wav2vec2.py#L978, https://github.com/facebookresearch/fairseq/blob/main/examples/speech_recognition/new/infer.py#L108 You would have to make appropriate changes in your...

Vineel Pratap

Update README.md

hypo.word file missing during MMS ASR inference

hypo.word file missing during MMS ASR inference

hypo.word file missing during MMS ASR inference

MMS - Forced Alignment: Array shapes problem

MMS - Forced Alignment: Array shapes problem

How to convert mms model to hf model?

Poor performance in Chinese

Poor performance in Chinese

Poor performance in Chinese