Eric Curtin

Results 479 comments of Eric Curtin

@cgwalters I am trying to use skopeo, but if that doesn't work out, maybe bootc docker can interact directly. WDYT?

We may want to bypass skopeo docker-daemon anyway, performance seems shocking, takes a long time and eats up CPU hard...

Maybe you need this fix? https://github.com/containers/ramalama/pull/802

It's `--ngl 999` rather than `--ngl=999`

You need to put it after the run command I think

Can you try updating the version of ramalama, this ngl thing was added recently enough

Could you paste the full "--debug" output? Also what are you using to check if the GPU is being utilised? nvtop? 5 seconds reasonable to initialized a GPU. Might be...

@Split7fire seems like llama-run/llama-bench is crashing then, you'll need to debug this in the llama.cpp layer

First attempt, GPU passthrough isn't working (I tried with AMD GPU) and there's some weird issue where ">" displays as "e", could be because of the emoji or TERM or...