Tan Tran

Results 8 comments of Tan Tran

I may come a little late, so just comment here for anybody else encountering this issue later on. The original problem I guess is you are using more than 4...

You can start using the pretrained model on your own dataset to see how well it works. As the author mentioned, the pretrained is already trained on VoxCeleb1, VoxCeleb2 and...

I believe that there would be no optimal answer to this. You can start trying fine-tuning with all you got and see how well the model converge. If you don't...

@ngocanh2162 Khi làm việc với các mô hình `generative` cho `speech` thì các từ bị phát âm như bị khàn mình rất hay gặp nhưng nguyên nhân thì mỗi bài...

The average length is about 5s. In my observation, this model works well with utterance length ranging 5-7s and quite bad for the much shorter utterances (< 1s). I did...

@michaelgfeldman Glad that I could help! For the question, I actually have no idea. The only practical thing I could think of is just giving it a try and see...

Hi, I would love you to answer this in English so that any other people can reference it (though we're Vietnamese lol). For very short utterances, what I did was...

It's sorry that I don't have any experience with VAD or handling noise. But I think you can consider noise reduction/suppression/cancellation techniques, or whichever algorithms that cancel out the background...