silero-vad issues

8000Hz ONNX model❓ Questions / Help / Support

1

First, congratulations for your work and for sharing it with the community !! https://github.com/snakers4/silero-vad/releases/tag/v3.1 indicates > Only 16kHz available now (ONNX has some issues with if-statements and / or tracing...

nicolaspanel

help wanted

Changelog - V5 just released!

32

Just a handy issue to be notified of latest changes and micro-releases (we will mostly changing the models)

snakers4

documentation

New Features Poll

5

Thinking about new features, medium term: [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Higher%20accuracy)](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Higher%20accuracy/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Python%20packages)](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Python%20packages/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/8k%20support%20for%20ONNX)](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/8k%20support%20for%20ONNX/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Higher%20performance)](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Higher%20performance/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Wake%20word%20detection%20models)](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Wake%20word%20detection%20models/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Other%20model%20formats%20(please%20comment))](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Other%20model%20formats%20(please%20comment)/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Music%20detection%20for%20longer%20utterances%20(several%20seconds))](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Music%20detection%20for%20longer%20utterances%20(several%20seconds)/vote) [![](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Audio%20event%20classification%20for%20longer%20utterances%20(several%20seconds))](https://api.gh-polls.com/poll/01FRHQ0TATC7CTV8ES2SKA1RKW/Audio%20event%20classification%20for%20longer%20utterances%20(several%20seconds)/vote)

snakers4

Mobile / Edge / ARM / ONNX Use Cases

23

While the VAD (especially the micro one) was explicitly designed for IOT / edge / mobile use cases, we do not have the resource or expertise to provide instructions for...

snakers4

help wanted

About background vocals

1

Hi, I found the vad model is not very good at filtering background vocals , for example TV vocals. Are there parameters to adjust to filtering background vocals ？ Thanks.

moodeerf

help wanted

❓ Same .wav file but got different timestamps

1

## ❓ Questions and Help I found the speech timestamps respectively obtained from pytorch and silero-vad-onnx.cpp is somewhat different. The input file is 'en_example.wav' which downloaded from torch.hub. the speech...

DarrenChengdu

help wanted

❓ No onnx gpu support?

5

Hi, Just wondering is there no onnx gpu support? Would it not be any faster than jit when moving the model to CUDA with a .to() ? This is what...

CircuitCM

help wanted

v5

Feature request - Finetuning or Pretraining for Urdu

6

Please tell me if I can fine-tune this model on urdu dataset or train from scratch with same architecture for Urdu?? In short please elaborate the architecture of this model...

hunzlausman

enhancement

Bug report - Regression of VAD quality between 3.1 and 4.0 (speech detected on perfect silence)

7

## 🐛 Bug On some audio, the quality of the VAD is reallly worse in the latest version v4.0, compared to what it was in v3.1 More precisely, v4.0 detects...

Jeronymous

bug

v5

Feature request - [X]Please tell me if I can fine-tune this model on urdu dataset or train from scratch with same architecture for Urdu??

## 🚀 Feature ## Motivation ## Pitch ## Alternatives ## Additional context

hunzlausman

enhancement

silero-vad
silero-vad copied to clipboard

Metadata

8000Hz ONNX model❓ Questions / Help / Support

Changelog - V5 just released!

New Features Poll

Mobile / Edge / ARM / ONNX Use Cases

About background vocals

❓ Same .wav file but got different timestamps

❓ No onnx gpu support?

Feature request - Finetuning or Pretraining for Urdu

Bug report - Regression of VAD quality between 3.1 and 4.0 (speech detected on perfect silence)

Feature request - [X]Please tell me if I can fine-tune this model on urdu dataset or train from scratch with same architecture for Urdu??

← Metadata

Owner

Metadata

silero-vad silero-vad copied to clipboard

Metadata

← Metadata

Owner

Metadata

silero-vad
silero-vad copied to clipboard