JaonLiu comments

Results 72 comments of


                                            JaonLiu

还会更新tf2版本吗？

怎们转为 tf2 能够直接使用的模型？求~

Documentation for exporting openai/whisper-large-v3 to ONNX

@fxmarty the log: ``` Validating ONNX model /share_model_zoo/LLM/openai/onnx_whisper-large-v3/encoder_model.onnx... -[✓] ONNX model output names match reference model (last_hidden_state) - Validating ONNX Model output "last_hidden_state": -[✓] (2, 1500, 1280) matches (2, 1500,...

Documentation for exporting openai/whisper-large-v3 to ONNX

> @mmingo848 You can use: > > ```shell > optimum-cli export onnx --help > optimum-cli export onnx --model openai/whisper-large-v3 whisper_onnx > ``` > > and then use [ORTModelForSpeechSeq2Seq](https://huggingface.co/docs/optimum/main/en/onnxruntime/package_reference/modeling_ort#optimum.onnxruntime.ORTModelForSpeechSeq2Seq). > >...

Documentation for exporting openai/whisper-large-v3 to ONNX

> @MrRace You need `--task automatic-speech-recognition-with-past`. There should be a log during the export about it (that specifying `--task automatic-speech-recognition` disables KV cache). @fxmarty Thank you very much for your...

Documentation for exporting openai/whisper-large-v3 to ONNX

> Yes, this was fixed in #1780, which is not yet in a release. > > Please downgrade to onnx 1.15 or use optimum from source. @fxmarty Thanks a lot,...

Documentation for exporting openai/whisper-large-v3 to ONNX

> Hi @MrRace, if you don't want to reimplement the inference code from scratch, I advise you to use https://huggingface.co/docs/optimum/main/en/onnxruntime/package_reference/modeling_ort#optimum.onnxruntime.ORTModelForSpeechSeq2Seq. An example is available there. By default, only `encoder_model.onnx` and...

[Feature Request] run the LLM model on the Qualcomm Hexagon NPU in Android OS

wish+1008611

[Bug] Yi model error：TVM runtime cannot find vm_load_executable

> mlc_config.json @Mawriyo Thanks for your reply. Here is the content of mlc_config.json:: ``` { "model_type": "llama", "quantization": "q4f16_1", "model_config": { "hidden_size": 4096, "intermediate_size": 11008, "num_attention_heads": 32, "num_hidden_layers": 32, "rms_norm_eps":...

[Bug] Using Qwen1.5-1.8B-Chat and Qwen1.5-4B-Chat will cause the app to freeze and crash

@Hzfengsy Which version of Qwen1.5 are you specifically using? Qwen1.5-0.5B-Chat? Or Qwen1.5-1.8B-Chat? Or Qwen1.5-4B-Chat?

请问这些图是如何制作的呢？算法实现的源代码能公开一下吗？

same request here!