ASRT_SpeechRecognition icon indicating copy to clipboard operation
ASRT_SpeechRecognition copied to clipboard

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Results 112 ASRT_SpeechRecognition issues
Sort by recently updated
recently updated
newest added

您好, 想请教一下大佬,我在使用CTCLoss作为损失函数进行训练之后,预测的结果类似于:['n','i','blank','blank','blank','blank','blank','blank','blank','h','a','o'],这并不像尖峰表示法,是什么原因导致呢?

目前,只有语音转文字,但可以写个文字转语音的版本么?

您好,我在用您的模型测试时,速度特别慢。1660ti,显存占用率在10%以下。该怎么解决

我下载了最新的master代码,解压了数据,然后训练也可以跑起来: [*Info] Create Model Successful, Compiles Model Successful. [running] train epoch 0 . [message] epoch 0 . Have train datas 0+ Epoch 1/1 500/500 [==============================] - 7968s 16s/step - loss:...

你好,我用的是ASRT_v0.6.0的版本,想做asr转换测试 我用了自己的wav文件,格式的话和您推荐的是同样的格式 file output1111.wav output1111.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 16000 Hz 但是在用inference的时候报了如下的错误 could not broadcast input array from shape (6758,200,1) into shape (1600,200,1) 想问问是什么原因?要怎么样才可以修正呢?谢谢

如何设置或训练,使其只识别整数、小数和部分关键词,需要修改原数据集吗

你好,请问可以给我发一份ST-CMDS的数据集嘛,网速太慢,实在现在不下来,万分感谢

下载ASRT_SpeechRecognition-0.6.1.tar.gz里面没有训练好的模型参数,没有model_speech文件夹

如题,运行`test.py`后,会出现错误: ```bash Traceback (most recent call last): File "/Users/MaTianlai/Downloads/ASRT_v0.6.1/test.py", line 35, in r = ms.RecognizeSpeech_FromFile('/Users/MaTianlai/Downloads/ASRT_v0.6.1/resources/test.wav') File "/Users/MaTianlai/Downloads/ASRT_v0.6.1/SpeechModel251.py", line 380, in RecognizeSpeech_FromFile r = self.RecognizeSpeech(wavsignal, fs) File "/Users/MaTianlai/Downloads/ASRT_v0.6.1/SpeechModel251.py", line 365, in...