Spoon

Results 2 comments of Spoon

Having this same issue for Llama3-8b. Trying to run it on a single GPU in 4-bit mode.