JaonLiu comments

Results 72 comments of


                                            JaonLiu

[Model Request] please support Qwen-VL model

wish +10086

[Model Request] please support Qwen-VL model

> Thanks for your request! We will try to support that soon. Any good news?

Process is going to kill itself!

Same problem！when use Qwen2！But for Qwen1.5, it can work！

GRPO训练报错：Fatal Python error: none_dealloc: deallocating None: bug likely caused by a refcount error in a C extension

same error when use GRPO

SenseVoice可以部署到手机本地运行吗？

@csukuangfj Can you please share a detailed tutorial? The tutorial on https://k2-fsa.github.io/sherpa/onnx/sense-voice/export.html#the-code is not very detailed.

[Question] Speculative Decoding Mode

@sunzj does the Speculative Decoding Mode can been used in Android ?

TypeError: ModernBertForSequenceClassification.init() got an unexpected keyword argument 'compile'

same error

[Feature Request] Lookahead Decoding support

> Likely the self speculating models like eagle would help in this case @tqchen How to use the Eagle inference acceleration on an Android phone with MLC-LLM? Thanks a lot!

[Feature Request] Lookahead Decoding support

> Likely the self speculating models like eagle would help in this case @tqchen when use eagle, it seems need to train a draft model~

[Bug] how to accurately measure the real memory usage on Android ？

> a few hundred MB must be the CPU memory usage. However, the model is stored in the GPU memory, so OS-level memory command is not enough @Hzfengsy Is there...