int1 topic
List
int1 repositories
neural-speed
273
Stars
31
Forks
Watchers
An innovative library for efficient LLM inference via low-bit quantization
An innovative library for efficient LLM inference via low-bit quantization