cortex.cpp icon indicating copy to clipboard operation
cortex.cpp copied to clipboard

Discussion: How to tell if Nitro is using a GPU?

Open dpkirchner opened this issue 1 year ago • 4 comments

Motivation

I'm tinkering with Nitro for the first time. I've successfully loaded a small model, however responses from the completions route are very slow (I'm minutes in and have yet to receive a response).

Discussion

I don't know if Nitro is actually using the GPU; it's definitely using all of my CPU so I'm guessing not. Is there some string that I can look for in stdout or stderr? For what it's worth, I've verified that other, non-Nitro run in the same fashion does indeed use the GPU.

Resources

I'm using the example commands here: https://github.com/janhq/nitro/blob/d0cc50b893609d66006cf8d2b24f0568cd687ad5/README.md

dpkirchner avatar Jan 10 '24 19:01 dpkirchner

if you are using "ngl" with value ranging from 0-3x and using a distribution of binary with cuda , it's using gpu, you can tell from the log also when model is loaded up

tikikun avatar Jan 11 '24 01:01 tikikun

There seems to be a problem with the install script since version 0.2.5 where the incorrect download URL is used. There are now 2 different cuda releases:

But the download URL leaves off the -11-7 or -12-0 and only appends -cuda. We were able to navigate around this but updating the install.sh manually to include the -12-0 postfix. Either the script needs to be updated to allow for more granular selection or one of the created cuda releases needs to be assigned as a default for the script to work.

Hoekz avatar Jan 11 '24 18:01 Hoekz

i'd recommend not use install script since it's not recently updated, using pure binary file will give you the best experience

tikikun avatar Jan 12 '24 01:01 tikikun

@hiento09 can also have a look at the install script

tikikun avatar Jan 12 '24 04:01 tikikun