pyllama icon indicating copy to clipboard operation
pyllama copied to clipboard

12GB card

Open arthurwolf opened this issue 1 year ago • 2 comments

My card has 12GB of RAM, that's not a case covered anywhere i could see. Would this allow me to do more (run the larger models, etc)? Any chances to get instructions for larger cards?

Thanks!

arthurwolf avatar Jul 30 '23 23:07 arthurwolf

You can run Quantized 7B model on your pc, but with the full version which is the not quantized version of 7B, you won't be able to run it, because it will literally eat 12G of RAM. Just use a cloud server or just use the quantized version if you want to explore prompt engineering.

miko8422 avatar Jan 01 '24 03:01 miko8422

You can run Quantized 7B model on your pc, but with the full version which is the not quantized version of 7B, you won't be able to run it, because it will literally eat 12G of RAM. Just use a cloud server or just use the quantized version if you want to explore prompt engineering.

I also have 12G of RAM on my own pc, but I wasn't been able to run the offical 7B model. By the way, I'm using 4070... I start to hate why I didn't bought the 4090 to run this model because I wan't to do some tuning on it.

miko8422 avatar Jan 01 '24 03:01 miko8422