王召德 comments

Results 98 comments of


                                            王召德

服务

你的意思是部署在服务端，然后通过请求调用吗

Missing a file for the llama3-8b-instruct-mnn model

not missing file, just process info has some inaccurate

修改Llama2_7b参数，导出mnn模型后在android设备上咯啊的、

看一下模型文件是否完成吧

MNN release 版本上编译LLM引擎libllm 和 llm_demo 出错

是不是编译器不支持c++17

How to get LLM model performance?

I modified the original demo to add the display of `prefill` and `decode` speeds. The code as follows: https://github.com/wangzhaode/mediapipe-llm-demo/blob/main/android/app/src/main/java/com/google/mediapipe/examples/llminference/InferenceModel.kt

[Bug] mmlu_pro结果正则提取出错

``` def first_option_postprocess(text: str, options: str, cushion=True) -> str: text = text.replace("Answer: Let's think step by step.", '') .... ``` 我按照这个修改后，结果提取看起来是正常了的，但是这么修改可能不太合理，所以没有提交PR；辛苦看一下是否有更合适的修改方式。

[Bug] mmlu_pro结果正则提取出错

我的有一些改动，就是使用`Qwen2.5-VL-Instruct`测评`mmlu_pro`，结果中就会出现此问题；我尝试使用`Qwen2.5-0.5B-Instruct`也有此问题。如果尝试复现可以使用此命令： ``` python run.py eval.py ``` `eval.py`内容如下： ```python from mmengine.config import read_base with read_base(): from opencompass.configs.datasets.mmlu_pro.mmlu_pro_gen_cdbebf import mmlu_pro_datasets datasets = [*mmlu_pro_datasets] from opencompass.models import HuggingFacewithChatTemplate model_path = '/path/to/Qwen2.5-0.5B-Instruct' models =...

王召德

服务

Missing a file for the llama3-8b-instruct-mnn model

修改Llama2_7b参数，导出mnn模型后在android设备上咯啊的、

MNN release 版本上编译LLM引擎libllm 和 llm_demo 出错

How to get LLM model performance?

[Bug] mmlu_pro结果正则提取出错

[Bug] mmlu_pro结果正则提取出错

修改模型输入尺寸

没有开个cv::mat 转换到mnn入参的例子吗大佬

Integrate kleidiAI release v0.1.0 into MNN 2.9.3