jan
jan copied to clipboard
bug: Model large than 10 gb don't work correctly
Describe the bug In attached video i run model Phind 34B to show bug
Steps to reproduce Steps to reproduce the behavior:
- Download any recommended model large than 10 gb and small than 30 gb
- Click on 'Regenerate or start new thread'
- The model is not working correctly. When launched, the speed is very slow. Moreover, either it writes an empty line or incoherent words. Regardless of the model, what matters most is that it is over 10 GB and less than 30 GB
Expected behavior Run model in recommended list and it work correctly.
Environment details
- Operating System: [MacOS Sonoma 14.2.1]
- Jan Version: [0.4.5]
- Processor: [Apple M1]
- RAM: [32GB]
Additional context Add any other context or information that could be helpful in diagnosing the problem.
https://github.com/janhq/jan/assets/36083130/58fa81df-8c3e-487c-8c3e-6f2ef18b38f3
Please help us by providing the app.log for troubleshooting: https://jan.ai/troubleshooting/how-to-get-error-logs/
app.log My app log
Hi there,
maybe the problem has to do with not enough memory
I tested with the model phind-codellama-34b-v2.Q5_K_M.gguf on my MacBook Pro 64GB RAM and it works without errors.
Here is my configuration.
Environment details
- MacBook Pro
- Operating System: [MacOS Sonoma 14.3]
- Jan Version: [0.4.5]
- Processor: [Apple M1 Max]
- RAM: [64GB]
Additional context I have attached my app.log (phind34b_app.log) for comparison. Hope it will help you with the analysis.
In @raveismadly 's app.log you can see repeated NITRO errors after update_slots, e.g. B. after lines 4553, 5067, 5283, 5309 etc.
[NITRO]::Error: ggml_metal_graph_compute: command buffer 5 failed with status 5
You can't see these errors in my log file.
Kind regards
Matthias
Hi @Van-QA What's the cause of this?
hi, we will investigate this issue as an OOM case, thank you.
Hi @Van-QA What's the cause of this?
Seems like the problem is because you run a model that is too big for your RAM. At this point it's our fault that don't deliver a clear error message. This issue is addressed in this ticket, you can track the progress here: https://github.com/janhq/jan/issues/1627