PaddleSpeech issues

Results 289 PaddleSpeech issues

Sort by recently updated

[TTS] Python API experience：huggingface Runtime error

![image](https://github.com/PaddlePaddle/PaddleSpeech/assets/50574730/491834a2-7126-409a-8bb3-e1f78f36f9fd)

jaysonteng

Bug

T2S

目前官方TTS流式语音合成支持中英文混合识别吗？

目前官方TTS流式语音合成支持中英文混合识别吗？测试发现只单独支持中文或者英文

AI-Mart

Question

调用 asr = ASRExecutor()gpu一直增加

调用 asr = ASRExecutor() current_result = asr(audio_file=audio_path_output, lang="en", model="transformer_librispeech")后gpu内存没有被释放掉，每调用一次，gpu内存都会增加，最后导致gpu崩溃[S2T]XXXX

hjj-lmx

Bug

S2T

【有手就行】使用你自己的声音做语音合成-这项目有跑起来的吗?

## General Question 【有手就行】使用你自己的声音做语音合成-这项目有跑起来的吗? https://aistudio.baidu.com/projectdetail/5003396 感觉改了一大堆东西什么用都没有,各种环境报错.无力吐槽

fantasysea

Question

在vits模型文件下的wavenet 文件中的residual_block.py文件中的第127行的下面代码中， x = F.dropout(x, p=self.dropout_rate, training=self.training) 参数training使用的self.training,没有在该py文件的任何地方定义，导致报错。 ![image](https://github.com/PaddlePaddle/PaddleSpeech/assets/108966996/39a02136-c8cb-4796-aa3d-e2ec8eca1cf2)

JoyceMind

Bug

T2S

请问为什么使用流式语音识别服务的时候，识别速度越来越慢？

比如我输入50s的语音，在40s之后显然感觉到识别结果刷新更慢了。

gooloosk

Question

tts python服务怎样设置cpu_threads

rt, 在application.yaml中添加了cpu_threads: 但是不起作用

kli017

Question

官方的例子不能成功运行

《【PaddleSpeech】一键预测，快速上手Speech开发任务》这个官方例子使用aistudio都不能成功运行。

klzhong69

流式ASR，支持8kHz的wav音频吗？

我解析出来结果不正确，我用的conformer_online_wenetspeech模型，把原来是16000改成了8000，以下是我的配置文件： `asr_online: model_type: 'conformer_online_wenetspeech' am_model: # the pdmodel file of am static model [optional] am_params: # the pdiparams file of am static model [optional] lang: 'zh' sample_rate: 8000 cfg_path: decode_method:...

z070204z

Question

[S2T]流式ASR麦克风识别---中英文---['asr_online']---需求

成功运行了 [PaddleSpeech-Streaming-ASR-Client](https://github.com/xiaomingnio/PaddleSpeech-Streaming-ASR-Client/tree/db6787e5da75e3948c9120b8af6659a71ec8146f) /websocket_client.py 发现并不支持`pretrained_models.py`中的`conformer_talcs-codeswitch_zh_en-16k`模型， `Can't find "conformer_talcs-codeswitch-zh_en-16k" in resource. Model name must be one of ['conformer_online_wenetspeech-zh-16k', 'conformer_u2pp_online_wenetspeech-zh-16k', 'conformer_online_multicn-zh-16k', 'conformer_online_aishell-zh-16k', 'deepspeech2online_wenetspeech-zh-16k', 'deepspeech2online_aishell-zh-16k']` 找到了`conformer_talcs_application.yaml`中发现`protocol`并不支持`websocket`，`engine_list`也不支持`asr_online`。有大佬知道在哪儿修改吗？

ScottXiao233

Bug

S2T

PaddleSpeech
PaddleSpeech copied to clipboard

Metadata

[TTS] Python API experience：huggingface Runtime error

目前官方TTS流式语音合成支持中英文混合识别吗？

调用 asr = ASRExecutor()gpu一直增加

【有手就行】使用你自己的声音做语音合成-这项目有跑起来的吗?

[TTS]没有设置参数但使用了该参数的值

请问为什么使用流式语音识别服务的时候，识别速度越来越慢？

tts python服务怎样设置cpu_threads

官方的例子不能成功运行

流式ASR，支持8kHz的wav音频吗？

[S2T]流式ASR麦克风识别---中英文---['asr_online']---需求

← Metadata

Owner

Metadata

PaddleSpeech PaddleSpeech copied to clipboard

Metadata

← Metadata

Owner

Metadata

PaddleSpeech
PaddleSpeech copied to clipboard