yang.tian comments

Results 6 comments of


                                            yang.tian

Misc. bug: llama-cli (vulkan backend) output gibberish with old vulkan sdk

Get the same case from my side. @franklei01 You mean by set below env macro to 1 before run llama-cli would get vulkan output correctly? GGML_VK_DISABLE_COOPMAT

Misc. bug: llama-cli (vulkan backend) output gibberish with old vulkan sdk

> > Get the same case from my side. > > [@franklei01](https://github.com/franklei01) You mean by set below env macro to 1 before run llama-cli would get vulkan output correctly? GGML_VK_DISABLE_COOPMAT...

Misc. bug: llama-cli (vulkan backend) output gibberish with old vulkan sdk

`root@localhost:~/build-vulkan/bin# taskset -c 0,5,6,7,8,9,10,11 ./llama-bench -m Qwen2.5-3B-Instruct-Q4_0.gguf -pg 128,128 -t 8 ggml_vulkan: Found 1 Vulkan devices: ggml_vulkan: 0 = Mali-G720-Immortalis (Mali-G720-Immortalis) | uma: 1 | fp16: 1 | warp size:...

compilation error: `array subscript 5 is above array bounds` with `aarch64-none-linux-gnu-g++ 12.2.Rel1`

meet the same issue building armnn 22.11.01 using gcc12.3

Qwen2-vl-2b和Qwen2.5-vl-3b模型opencl推理llm部分，首次推理正确，再次推理结果都是感叹号！！！！！

> 建 tmp 文件的话，后续推理正确么？看着每次都没有缓存 Update cache to tmp/mnn_cachefile.bin, size = 3015836 Open tmp/mnn_cachefile.bin error Write Cache File error! 这个文件是否存在貌似对性能有比较大的影响？我这边测试的时候vulkan上看到prefill差异很大

Qwen2-vl-2b和Qwen2.5-vl-3b模型opencl推理llm部分，首次推理正确，再次推理结果都是感叹号！！！！！

> > > 建 tmp 文件的话，后续推理正确么？看着每次都没有缓存 > > > > > > Update cache to tmp/mnn_cachefile.bin, size = 3015836 Open tmp/mnn_cachefile.bin error Write Cache File error! > > 这个文件是否存在貌似对性能有比较大的影响？我这边测试的时候vulkan上看到prefill差异很大...