DakeQQ comments

Results 18 comments of


                                            DakeQQ

希望能增加对FunASR模型的支持

1. 欢迎使用[本仓库](https://github.com/DakeQQ/Transcribe-and-Translate-Subtitles)来离线转录和翻译视频 (支持批量)，确保最高的隐私保护。 2. 翻译功能基于大语言模型（LLM），如 Qwen2.5-7B、Qwen2.5-14B、GLM4-9B 等。 3. 该工具对于CPU用户友好，16GB 内存的电脑即可运行LLM-7B。 4. 转录功能依托于多种强大的模型，包括 SenseVoiceSmall、Paraformer、Whisper V2 / V3 / Turbo，以及针对日语的微调 Whisper 模型。 5. 仓库包含 SenseVoiceSmall、Paraformer、FSMN-VAD、Denoiser-ZipEnhancer 和 Denoiser-DFSMN，这些都是阿里巴巴家族的模型，在中文任务中表现优异。

Any plans to support exporting onnx and TensorRT engine?

Feel free to use this [repo](https://github.com/DakeQQ/STFT-ISTFT-ONNX) to export your custom STFT or ISTFT process to ONNX format.

Issues Related to TensorRT Accelerated Inference

Feel free to use this [repo](https://github.com/DakeQQ/STFT-ISTFT-ONNX) to export your custom STFT or ISTFT process to ONNX format. There’s no need to separate the STFT and ISTFT from the model anymore.

ONNX模型转换报错，模型无法使用

Feel free to reference this [repo](https://github.com/DakeQQ/Automatic-Speech-Recognition-ASR-ONNX). It is an end-to-end version that includes the STFT process. Simply provide the audio input to obtain the ASR result. You can also customize...

sensevoice-onnx模型每次识别新的（之前没有见过的）输入语音都要加载10几秒，很影响推理效率，这个问题如何解决？

请问是否可以提供转成onnx的相关指导文档，谢谢

欢迎参考这份[Python脚本](https://github.com/DakeQQ/Native-LLM-for-Android/blob/main/Export_ONNX/MiniCPM/MiniCPM_Export.py)来导出MiniCPM4-ONNX

希望大大出个yolo11的cpp例子

[欢迎参考ONNX Runtime框架的YOLO :)](https://github.com/DakeQQ/YOLO-Depth-Estimation-for-Android)

Abnormal of onnx model to trt model in the inference results

Welcome to refer to this [export script](https://github.com/DakeQQ/Text-to-Speech-TTS-ONNX/blob/main/BigVGAN/Export_BigVGAN.py) to export BigVGAN to ONNX format.

[about ONNX export]

MPS Backend Not Working – CPU is Slow on Apple Silicon

@theseriouspdx Feel free to explore this [repository](https://github.com/DakeQQ/Transcribe-and-Translate-Subtitles), designed specifically for CPU-only users. With an Intel i3-12300 CPU (4 threads), it can transcribe a 2-hour movie in approximately 20 minutes using...