silero-vad
silero-vad copied to clipboard
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
## ๐ Bug Onnxruntime version 1.16 has been released yesterday. If I use it to load silero-vad using onnx=True, i get ```python ValueError: This ORT build has ['AzureExecutionProvider', 'CPUExecutionProvider'] enabled....
## ๐ Feature When we use get_speech_timestamps function we can assgin Parameters like _min_speech_duration_ms_,as I see it, they two actually are doing the same thing,does it mean that VADIterator can...
Hello! First of all thanks for the VAD model, it is great and really helpful! I've been doing some experiments with the 16 kHz ONNX models in order to establish...
in file examples/cpp/silero-vad-onnx.cpp, line 124 Do you mean "if (temp_end == 0)" ? Otherwise temp_end will never get non-zero values
## ๐ Bug An exception is thrown when attempting to create a session with silero_vad.onnx using winrt::Windows::AI::MachineLearning. It appears that silero_vad.onnx is using Opset 16, which I believe is not...
## ๐ Bug Hi, I am trying to batch the input to the pytorch model, but the batched input works for when the model is in cpu. But when the...
adding parallelism using ProcessPoolExecutor and concurrent.futures
## ๐ Bug It's not possible to convert the Silero VAD model to work with CoreML or with other conversion tools. ## To Reproduce Steps to reproduce the behavior: Create...
I have tried two Chinese speaker diarization data sets but their results are not good, especially when the human voice is removed as noise. Can this be fine-tuned๏ผ The code...
Dear members of the community, Finally, we are nearing the release of the `v5` version of the VAD. Can you please send your audio edge cases in this ticket so...