Alok Prasad
Alok Prasad
@shcheklein like the S3, Local , SSH support for Model and Data storage , Some organization also uses Sharepoint for Data storage Some links i found https://www.dmcinfo.com/latest-thinking/blog/id/9388/how-to-upload-a-file-to-sharepoint-on-premises-using-python
@JanX2 Do you have standalone repo of VAD from opus tree ?
@lissyx Sorry for late response here are the samples 1. trimmed 2. silence added to above trimmed file https://soundcloud.com/alok-prasad-213091558/sets/deepspeech-test-files actual utterance in the speech file is "why should one hold...
my usecase is wakeword + speech , where my system feeds (streaming) audio to deepspeech to detect wakeword as soon as it is detected next frame onwards it feeds audio...
@reuben amount of silence here is very small .not sure even after removing silence i above issue wont be resolved.Probably with Augmentation we have to chop few initial frames of...
@reuben , If silence is all zero deepspeech do not work , eg for utterance "Go back " - > Deepspeech gives "back" but if all zero silence of 100ms...
> I tested it with [Freesound Dataset Kaggle 2019](https://zenodo.org/record/3612637#.Xjq7OuEo9rk) which has about 103h of noise data. > Everything worked as intended. Only i didnt see a great difference in my...
@mychiux413 any idea how can be this done ? it should be online process ?
@mychiux413 anyway we can dump the mixed files and see how effective is the mixing of noise to speech file.just to make sure mixing is proper
@mychiux413 I also tried to save the audio using tf.print 's output_stream option in following function ``` "def augment_noise" noise_ratio = tf.math.pow(10.0, choosen_noise_db / 10) mixed_audio = tf.multiply(audio, audio_ratio) +...