llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

llama : model-based max number of graph nodes

Open ggerganov opened this issue 7 months ago • 1 comments

fix #8615

Propose to determine the max number of nodes based on the model info (arch, hparams, etc.)

ggerganov avatar Jul 22 '24 06:07 ggerganov