Taige Wang

Results 1 comments of Taige Wang

~~Same issue here. Windows 11, Cuda 11.8/12.3, Python 3.12/3.11, model `llama-2-13b-chat.Q8_0.gguf`, same output.~~ Update: Got it fixed. It turns out that my CPU does not support AVX2, so I cloned...