gpt4all icon indicating copy to clipboard operation
gpt4all copied to clipboard

Add Llama-3.1-405B-FP8

Open Pierre-Gronau-ndaal opened this issue 1 year ago • 5 comments
trafficstars

Feature Request

Pierre-Gronau-ndaal avatar Jul 23 '24 20:07 Pierre-Gronau-ndaal

will you do us the honors of testing on your local machine

AndriyMulyar avatar Jul 23 '24 20:07 AndriyMulyar

It will be a pleasure

Pierre-Gronau-ndaal avatar Jul 24 '24 05:07 Pierre-Gronau-ndaal

We're working on this asap. It will work with the newest version which will be released shortly. Upstream llama.cpp is working on providing better support as well and when they do we'll be merging it in and making a new release as well.

manyoso avatar Jul 24 '24 11:07 manyoso

Thank you! Got it working in the UI no problem! Then I grabbed the filename it downloaded when I added the model and tested it in the GPT4all ux and everything worked.

Then I replaced it in my code model = GPT4All("Meta-Llama-3.1-8B-Instruct.Q4_0.gguf")

and I get this error:

LLAMA ERROR: failed to load model from /Users/davidsmith/.cache/gpt4all/Meta-Llama-3.1-8B-Instruct.Q4_0.gguf LLaMA ERROR: prompt won't work with an unloaded model!

davidsilvasmith avatar Jul 25 '24 20:07 davidsilvasmith

The python binding release has not been made to support it. Probably not till next week as python binding maintainer is on vacation.

manyoso avatar Jul 25 '24 20:07 manyoso

Can be closed, I guess? If still doesn't work, you can re-open.

ThiloteE avatar Mar 29 '25 10:03 ThiloteE