llama.cpp
llama.cpp copied to clipboard
llama : model-based max number of graph nodes
fix #8615
Propose to determine the max number of nodes based on the model info (arch, hparams, etc.)
- [x] I have read the contributing guidelines
- Self-reported review complexity:
- [ ] Low
- [ ] Medium
- [ ] High