ggml ggml vs onnxruntime on SOC chip

ggml vs onnxruntime on SOC chip

Open Francis235 opened this issue 9 months ago • 1 comments

Hello, I would like to know if anyone has compared the inference of ggml and onnxruntime on SOC in terms of latency, memory usage, %CPU and other indicators? For example, CPU/GPU backend.

Apr 30 '24 02:04 Francis235

I am interested in knowing too

Jun 06 '24 20:06 bkaruman

ggml ggml copied to clipboard

ggml vs onnxruntime on SOC chip

ggml
ggml copied to clipboard