icefall icon indicating copy to clipboard operation
icefall copied to clipboard

FYI: RTF for the latest Zipformer transducer

Open csukuangfj opened this issue 2 years ago • 4 comments

We just created a colab notebook to show the RTF (Real-time factor) of the latest zipformer transducer model.

We use sherpa-onnx for the CPU test. As for the GPU test, we infer RTF from the decoding logs for the librispeech test-clean and test-other dataset.

The results are summarized below: Screenshot 2023-08-18 at 13 01 45

Please see the following colab notebook for details.

https://github.com/k2-fsa/colab/blob/master/rtf_test_for_zipformer_transducer.ipynb

csukuangfj avatar Aug 18 '23 05:08 csukuangfj

can I ask you which CPU exactly?

armusc avatar Aug 18 '23 20:08 armusc

can I ask you which CPU exactly?

It is the cpu provided by colab. Please see the colab notebook for details. We have provided the output of lscpu in the colab notebook.

csukuangfj avatar Aug 19 '23 15:08 csukuangfj

thanks

armusc avatar Aug 19 '23 15:08 armusc

@csukuangfj have you also looked at GPU streaming mode with sherpa-onnx? I'm curious if that might also be impacted by the streaming strategy, e.g., if you'd expect a fairly large difference in speed between a.) streaming a bunch of short utterances in sequence (and thus not really fully occupying the GPU) vs b.) streaming long recordings or many short recordings in parallel.

AdolfVonKleist avatar Aug 21 '23 13:08 AdolfVonKleist