VAD
VAD copied to clipboard
ACAM always detect badly on the start of a corpus
As the title said, I found the corpus at the beginning always be detected as non-speech. Can you explain it?
Hi, is there any silence in front of your sample, if not, the result may be not good. Because ACAM is context based model, there should be some samples to capture the speech context. Please send me your sample to [email protected] I'll debug it for you.
Thank you for your reply. And I had sent the test audio to your email.