gpt4all Add Llama-3.1-405B-FP8

Add Llama-3.1-405B-FP8

Open Pierre-Gronau-ndaal opened this issue 1 year ago • 5 comments

trafficstars

Feature Request

Jul 23 '24 20:07 Pierre-Gronau-ndaal

will you do us the honors of testing on your local machine

Jul 23 '24 20:07 AndriyMulyar

It will be a pleasure

Jul 24 '24 05:07 Pierre-Gronau-ndaal

We're working on this asap. It will work with the newest version which will be released shortly. Upstream llama.cpp is working on providing better support as well and when they do we'll be merging it in and making a new release as well.

Jul 24 '24 11:07 manyoso

Thank you! Got it working in the UI no problem! Then I grabbed the filename it downloaded when I added the model and tested it in the GPT4all ux and everything worked.

Then I replaced it in my code model = GPT4All("Meta-Llama-3.1-8B-Instruct.Q4_0.gguf")

and I get this error:

LLAMA ERROR: failed to load model from /Users/davidsmith/.cache/gpt4all/Meta-Llama-3.1-8B-Instruct.Q4_0.gguf LLaMA ERROR: prompt won't work with an unloaded model!

Jul 25 '24 20:07 davidsilvasmith

The python binding release has not been made to support it. Probably not till next week as python binding maintainer is on vacation.

Jul 25 '24 20:07 manyoso

Can be closed, I guess? If still doesn't work, you can re-open.

Mar 29 '25 10:03 ThiloteE

gpt4all gpt4all copied to clipboard

Add Llama-3.1-405B-FP8

Feature Request

gpt4all
gpt4all copied to clipboard