Lain
Lain
Hey,Our project wants to introduce the spleeter as the pre-processing module of speech recognition, but what we need for speech recognition is streaming input. I don't know whether the spleeter...
Hello, I found this model in VaD recently. At present, we use the voice audio separated from this model to make VAD (using neural network and LSTM). At present, I...
您好,我在使用762数据集的时候,使用默认的conf提取了,feat.scp然后运行python3 local/extract_gop_feats.py,报错kaldi_io.kaldi_io.UnknownVectorHeader: The header contained 'CM ',想请教一下是为什么,以及使用Kaldi提取特征的参数是什么(mfcc,fbank等),已下是762默认的参数配置。 --use-energy=false # use average of log energy, not energy. --num-mel-bins=40 # similar to Google's setup. --num-ceps=40 # there is no dimensionality reduction. --low-freq=20...
Hello, I am very happy to finally wait for the demo of UnitySentis. This surprised me. I am a speech recognition algorithm engineer. We often encounter such problems, the inference...
Are there any plans to support this model? https://huggingface.co/kyutai/mimi