Yuekai Zhang comments

Results 129 comments of


                                            Yuekai Zhang

enc_dec: prompt_embedding_table not passed to encoder model

> @yuekaizhang > > Well, the plan is: > > * modify `WhisperEncoder` to have the same signature as regular `EncoderModel` > * use `prompt_embedding_table` input to pass actual fbanks...

Whisper example crashes with English-only models

> ### System Info > just a simple python bug, system agnostic > > ### Who can help? > @byshiue > > ### Information > * [x] The official example...

paraformer onnx-gpu耗时过长原因定位

> 请教下，CPP版本也会这样吗 GPU 会受影响，CPU不会。cpp 和 python 是一样的

paraformer onnx-gpu 转 tensorrt 报错 (Could not find any implementation for node)

@willnufe Need to make some modifications to the code in order to support it successfully. I don't have time recently, but if you are willing to do it, I can...

paraformer onnx-gpu 转 tensorrt 报错 (Could not find any implementation for node)

@willnufe I think to get the max throughput. We need to first make onnx fp16 paraformer work. https://github.com/modelscope/FunASR/commit/9a9b474e7de7cc90d2ee124dc8d6c2cfa887c059. This PR used several `registered_hook` to rescale the torchscript fp32 model to...

AssertionError: tensor WhisperEncoder/encoder_layers/0/attention_layernorm/layer_norm_L5155/NORMALIZATION_0_output_0 has an invalid shape

For distill-whisper, would you mind adding model=model.half() here https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/whisper/distil_whisper/convert_from_distil_whisper.py#L60 for now? The code fix will be synced to github later. Thanks.

您好，我按照triton/whisper中的readme文件（trt-llm：0.14.0.dev2024091700）执行，发现报错：Tensor:x is not an input tensor

@tianchengcheng-cn , https://github.com/k2-fsa/sherpa/blob/master/triton/whisper/Dockerfile.server#L6 请使用这个版本的 trt-llm，或者直接使用 docker-compose 最新的 trt-llm, 可以看这个代码 https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/whisper/run.py 自己修改或者等等我的更新。

Has offline zipformer TensorRT been supported?

> https://github.com/k2-fsa/sherpa/tree/master/triton/scripts Have checked the scripts here but only conformer trt script (triton/scripts/build_librispeech_pruned_transducer_stateless3_offline_trt.sh) released. Is it ok for zipformer to do export-onnx -> trtexec to get tensorrt engine too? @Vergissmeinicht...

Has offline zipformer TensorRT been supported?

@Vergissmeinicht Just comment the lines should be okay https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/zipformer/zipformer.py#L1422-L1427.

Has offline zipformer TensorRT been supported?

> > @Vergissmeinicht Just comment the lines should be okay https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/zipformer/zipformer.py#L1422-L1427. > > It works for me. But when I try using trtexec to convert the zipformer onnx model from...