gpt4all [Feature] GPU support in docker-based API server

[Feature] GPU support in docker-based API server

Open Matszwe02 opened this issue 1 year ago • 4 comments

trafficstars

System Info

GTX1060, Win10, GPT4All Falcon

Information

[ ] The official example notebooks/scripts
[ ] My own modified scripts

Reproduction

run GPT4All
Use GPT4All built in chat
Use GPT4All API
see that built in chat usesGPU (as selected in settings), API only uses CPU

Expected behavior

Use the same source with API that built-in chat uses

Feb 03 '24 15:02 Matszwe02

Getting inspiration from the Python module, I simply added "device": "gpu" to the JSON-HTTP call performed by CURL and gpt4all is using the GPU!

Full example:

url http://localhost:4891/v1/chat/completions -H "Content-Type: application/json"  -d '{ "model": "Nous Hermes 2 Mistral DPO","messages": [{"role": "user", "content": "help me"}], "temperature": 0.7,"device":"gpu" }'

So this ticket is more about documentation than a missing feature.

Mar 13 '24 18:03 scubbx

Getting inspiration from the Python module, I simply added "device": "gpu" to the JSON-HTTP call performed by CURL and gpt4all is using the GPU!

You're using the docker-based gpt4all-api server? I doubt that actually works: https://github.com/nomic-ai/gpt4all/blob/97de30edd100071ef70372f38e85959cac6378a3/gpt4all-api/gpt4all_api/app/api_v1/routes/chat.py#L50-L53

And then it proceeds to call GPT4All without ever passing the device argument: https://github.com/nomic-ai/gpt4all/blob/97de30edd100071ef70372f38e85959cac6378a3/gpt4all-api/gpt4all_api/app/api_v1/routes/chat.py#L62

Mar 14 '24 01:03 cebtenzzre

No, I'm using the Ubuntu installer. When testing the behaviour empirically, it works:

Mar 14 '24 01:03 scubbx

No, I'm using the Ubuntu installer. When testing the behaviour empirically, it works:

The OP reports that GPT4All Chat's built-in local server uses the GPU when one is selected in settings. Their request is to also add GPU support to the standalone docker-based API server. Your findings seem to agree with theirs.

Mar 14 '24 01:03 cebtenzzre

gpt4all-api has been removed, see #2314.

May 09 '24 22:05 cebtenzzre

gpt4all gpt4all copied to clipboard

[Feature] GPU support in docker-based API server

System Info

Information

Reproduction

Expected behavior

gpt4all
gpt4all copied to clipboard