lijiabao2

Results 2 comments of lijiabao2

My machine has a 16GB GPU and I also encountered this problem. It works perfectly after using 4-bit quantization. I ran through the author's code in this way.

You can find the answer you want in this article: "simple is effective: the roles of..."