lijiabao2
Results
2
comments of
lijiabao2
My machine has a 16GB GPU and I also encountered this problem. It works perfectly after using 4-bit quantization. I ran through the author's code in this way.
You can find the answer you want in this article: "simple is effective: the roles of..."