llama-api icon indicating copy to clipboard operation
llama-api copied to clipboard

exllama GPU split

Open atisharma opened this issue 9 months ago • 1 comments

It's not clear from the documentation how to split VRAM over multiple GPUs with exllama.

atisharma avatar Oct 27 '23 15:10 atisharma