stable-diffusion.cpp
stable-diffusion.cpp copied to clipboard
Support ternary models
ggml-quants : ternary packing for TriLMs and BitNet b1.58 #8151 is on the horizon
There are TerDiT: Ternary Diffusion Models
4.2B of ternary weights packed lossless:
- 2,000 bpw - 1.05GB
- 1,625 bpw - 914 MB