PaddleSpeech
PaddleSpeech copied to clipboard
处理使用 PaddleSpeech 过程中出现的报错 ValueError (InvalidArgument) Broadcast dimension mismatch
运行在百度 AiStudio
,相关依赖包:
paddle-bfloat 0.1.7
paddle2onnx 1.1.0
paddleaudio 1.1.0
paddlefsl 1.1.0
paddlenlp 2.5.2
paddlepaddle 2.4.2
paddlesde 0.2.5
paddleslim 2.6.0
paddlespeech 1.4.1
paddlespeech-ctcdecoders 0.2.1
paddlespeech-feat 0.1.0
ppdiffusers 0.19.4
Python 3.8.18
问题描述:
使用 语音识别 和 视频字幕 生成的 Python API 时出现报错。
出现以下报错:
ValueError: (InvalidArgument) Broadcast dimension mismatch. Operands could not be broadcast together with the shape of X = [1, 1, 0, 498] and the shape of Y = [1, 123, 123]. Received [498] in X is not equal to [123] in Y at i:3.
[Hint: Expected x_dims_array[i] == y_dims_array[i] || x_dims_array[i] <= 1 || y_dims_array[i] <= 1 == true, but received x_dims_array[i] == y_dims_array[i] || x_dims_array[i] <= 1 || y_dims_array[i] <= 1:0 != true:1.] (at /paddle/paddle/phi/kernels/funcs/common_shape.h:84)
Traceback (most recent call last):
File "test1.py", line 3, in <module>
result = asr(audio_file="zh.wav")
File "/home/aistudio/PaddleSpeech/paddlespeech/cli/utils.py", line 328, in _warpper
return executor_func(self, *args, **kwargs)
File "/home/aistudio/PaddleSpeech/paddlespeech/cli/asr/infer.py", line 512, in __call__
res = self.postprocess() # Retrieve result of asr.
File "/home/aistudio/PaddleSpeech/paddlespeech/cli/asr/infer.py", line 335, in postprocess
return self._outputs["result"]
KeyError: 'result'
参考 语言识别 功能相关文档时发现其中的 Python API 运行时不报错。 参考链接:https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/demos/speech_recognition/README_cn.md
经测试发现 asr_executor()
方法时还需要指定模型。
例:
from paddlespeech.cli.asr.infer import ASRExecutor
asr = ASRExecutor()
result = asr(audio_file="zh.wav",model='conformer_wenetspeech')
print(result)
运行结果:
写了两篇博客记录了相关过程,希望可以帮助到各位:
- 在百度 AiStudio 平台中使用 PaddleSpeech:https://blog.csdn.net/qq_45897239/article/details/136572991
- 处理使用 PaddleSpeech 过程中出现的报错 ValueError (InvalidArgument) Broadcast dimension mismatch:https://blog.csdn.net/qq_45897239/article/details/136573149