self-supervised-speech-recognition icon indicating copy to clipboard operation
self-supervised-speech-recognition copied to clipboard

speech to text with self-supervised learning based on wav2vec 2.0 framework

Results 29 self-supervised-speech-recognition issues
Sort by recently updated
recently updated
newest added

Hi there I am following the instructions in 'Install instruction' section but when I run this ```!pip install -e .``` I get the following error ``` file:///content/self-supervised-speech-recognition/libs/wav2letter/bindings/python Installing collected packages:...

Given that I'm using for training 5 GPUs GeForce GTX 1080 Ti with 10.917GB memory each, how can I calculate the max_tokens so that no memory error occurs?

I was wondering why in the finetune.py file you've set update_freq to be 24/NUM_GPU. ``` cmd.append("+optimization.update_freq='[" + str(int(24/NUM_GPU)) + "]'") ``` In the wav2vec Readme https://github.com/pytorch/fairseq/blob/master/examples/wav2vec/README.md they say that the...

"Please ensure that the architectures match.".format(filename) Exception: Cannot load model parameters from checkpoint /content/self-supervised-speech-recognition/wav2vec_small_960h.pt; please ensure that the architectures match. Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace....

Describe your question seeking guidelines from the community for improving models for production: different accents, for example, US/UK/AU English filler words, for example, um; ah domain words, for example, company...

Hi a mailong25, E có build source trên google colab, mọi thứ chạy oke, nhưng khi e test với data thì response trả về là ['']. E không tìm được...

Hey How do you recommend we tune the parameters in transcribe function: lm_weight, word_score, and beam_size? Normally with things like Deepspeech 2, we use its logits to tune this, but...

ERROR: Command errored out with exit status 1: /usr/bin/python3 -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/home/baoloc2509/Desktop/Project/Rever/Call_transcription/self-supervised-speech-recognition/libs/wav2letter/bindings/python/setup.py'"'"'; __file__='"'"'/home/baoloc2509/Desktop/Project/Rever/Call_transcription/self-supervised-speech-recognition/libs/wav2letter/bindings/python/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' develop --no-deps Check the logs for full...