seamless_communication icon indicating copy to clipboard operation
seamless_communication copied to clipboard

Finetuning for ASR and the dataset preparation

Open Gxhappiness opened this issue 1 year ago • 3 comments

Hi, I want to finetune the ASR on the custom dataset, so 2 issues have arisen: 1.How can I do the finetuning for ASR? Is it possible to make modifications only on finetune.py? How can I make modifications? 2.What format should I prepare the dataset? What is the content of manifest.json? Can anyone provide the specific content of the manifest.json?

Gxhappiness avatar Jan 11 '24 01:01 Gxhappiness

{"source": {"id": 1806, "lang": "eng", "text": "", "audio_local_path": "path to .wav", "waveform": null, "sampling_rate": 16000, "units": null}, "target": {"id": 1806, "lang": "urd", "text": "", "audio_local_path": "path to 491841998166793263.wav", "waveform": null, "sampling_rate": 16000, "units": null}}

write

adnankarim avatar Jan 18 '24 04:01 adnankarim

{"source": {"id": 1806, "lang": "eng", "text": "", "audio_local_path": "path to .wav", "waveform": null, "sampling_rate": 16000, "units": null}, "target": {"id": 1806, "lang": "urd", "text": "", "audio_local_path": "path to 491841998166793263.wav", "waveform": null, "sampling_rate": 16000, "units": null}}

write

thx

Gxhappiness avatar Jan 18 '24 04:01 Gxhappiness

can you share notebook which are helpful for the ASR custom finetuning

MuhammadWaqarSahi avatar Feb 27 '24 11:02 MuhammadWaqarSahi