sherpa-onnx
sherpa-onnx copied to clipboard
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V,...
Following : https://github.com/k2-fsa/icefall/issues/1119
#151 adds CoreML support for macOS. We also need to support iOS. The pre-built libs can be downloaded from https://onnxruntimepackages.z14.web.core.windows.net/pod-archive-onnxruntime-c-1.14.0.zip After unzipping, you will find the following files: ``` ....
The offline flow has no warning. However, when I ran the streaming flow, I got the following warnings: ``` 2023-05-25 00:21:23.008301845 [W:onnxruntime:, session_state.cc:1136 VerifyEachNodeIsAssignedToAnEp] Some nodes were not assigned to...
Hello, Are there any pre-trained models like icefall-asr-multidataset-pruned_transducer_stateless7-2023-05-04 (Englis) which are trained on GigaSpeech + LibriSpeech + Common Voice 13.0 for Streaming ASR Concurrently, which is the best pre-trained model...
Modifying the conditions of compilers on the x86 platform.
Hi every one, do we have a shallow fusion implementation with N-gram Language Model (e.g Kenlm or Srilm) instead of Neural Language Model? If not, can you give me some...
It is nice if someone can help integrate https://github.com/google/sanitizers/wiki/AddressSanitizer into shepra-onnx. It should help detect memory leak issues.
I changed `GetSessionOptionsImpl()` in `session.cc` to use OpenVINO. ```cpp Ort::SessionOptions sess_opts; sess_opts.SetIntraOpNumThreads(num_threads); sess_opts.SetInterOpNumThreads(num_threads); sess_opts.SetGraphOptimizationLevel(ORT_DISABLE_ALL); OrtOpenVINOProviderOptions options; options.device_type = "CPU_FP32"; sess_opts.AppendExecutionProvider_OpenVINO(options); return sess_opts; ``` After that, I ran into errors like...
Note: It does not seem to improve the speed when passing `--provider=directml` from the commandline.
I've seen that for Icefall, the 2 ways to export models are using either ONNX (this package) or NCNN. Has there been any benchmarking done for the 2 methods? I'm...