vosk-api icon indicating copy to clipboard operation
vosk-api copied to clipboard

Newbie: Having trouble on formatting my dataset. I want to perform language adaptation https://alphacephei.com/vosk/adaptation

Open moodpanda opened this issue 1 month ago • 3 comments

just wanna ask if is it possible to use my dataset it is formatted like this audio, transcription in csv file.

thanks for answering

moodpanda avatar May 11 '24 03:05 moodpanda

up same question

juliustuliao avatar May 13 '24 08:05 juliustuliao

This question is too vague for me to answer. If you need help you need to provide the details. What problem are you tryign to solve, what data do you have and so on.

nshmyrev avatar May 13 '24 08:05 nshmyrev

I want the model to recognize new words or sentences using my own dataset. Currently, my dataset is formatted with each entry containing the path to an audio file and its corresponding transcription. I am new to Kaldi and unsure of how to properly format my data. inorder to perform model adaptation

my dataset example: audio_file_path, transcription data/chunk_001.wav, hello, world

moodpanda avatar May 13 '24 08:05 moodpanda

Vosk models are adapted with just text, not the audio + text.

What is the language of your dataset. What models did you try? What is the current accuracy of the model.

nshmyrev avatar May 13 '24 10:05 nshmyrev

my dataset language is filipino and I'm trying to use model adaptation vosk-model-tl-ph-generic-0.6 to add new words or sentence on the model vocabulary

moodpanda avatar May 13 '24 13:05 moodpanda

@moodpanda this model is precompiled and we can not modify it. You have to contact Fed directly for update, https://github.com/feddybear/flipside_ph only he can do it.

nshmyrev avatar May 13 '24 13:05 nshmyrev

@moodpanda this model is precompiled and we can not modify it. You have to contact Fed directly for update, https://github.com/feddybear/flipside_ph only he can do it.

got it thankyou very much I will try to email him if he still active thankyou

moodpanda avatar May 13 '24 13:05 moodpanda

He certainly can help you. Best.

nshmyrev avatar May 13 '24 13:05 nshmyrev