lhez
Results
2
issues of
lhez
Properly identify mark multi rope and vision rope and mark them as unsupported so that these rope variants get put back to CPU and does not crash. Also `fp16` variant...
ggml
Currently small models like qwen2.5 0.5B does not work properly with OpenCL backend. This PR fixes this issue. This PR also changes subgroup size to 64 for all Adreno GPUs.
ggml