uni-saurabh-vyas
uni-saurabh-vyas
found some information at https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/onnx_model_wrapper.py
Hi @csukuangfj @yuekaizhang Here are some notes based on my understanding: - These _cache are actually implicit states defined in Nvidia Triton, that are used internally (https://github.com/triton-inference-server/server/blob/main/docs/user_guide/architecture.md#implicit-state-management) - The following...
I started working on it, but I am a bit confused about 1 thing. In https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/export.py#L291 I see you already have onnx script for streaming zipformer right? Also according to...
I created a PR at https://github.com/k2-fsa/sherpa/pull/430
Is there any recipe in sherpa triton for zipformer ctc streaming ?
Yes, I think onnx recipe for sherpa with triton for ctc zipformer would be nice to have.
getting same issue