KoboldAI-Client icon indicating copy to clipboard operation
KoboldAI-Client copied to clipboard

Can we have 8bit support?

Open oobabooga opened this issue 2 years ago • 2 comments
trafficstars

oobabooga avatar Jan 04 '23 16:01 oobabooga

See https://gist.github.com/whjms/2505ef082a656e7a80a3f663c16f4277

tensiondriven avatar Jan 31 '23 02:01 tensiondriven

If this were to be added and was usable on Colab then you could load up to like 13B models on a standard GPU. That way you wouldn't need to use TPUs to run bigger models since well they don't work atm. (using oobabooga's colab won't work on standard GPUs since it loads up the shards to the RAM and it would run out of memory but KoboldAI shouldn't have that problem)

minipasila avatar Feb 24 '23 02:02 minipasila