int3 topic
List
int3 repositories
neural-speed
350
Stars
39
Forks
350
Watchers
An innovative library for efficient LLM inference via low-bit quantization