int2 topic
List
int2 repositories
neural-speed
350
Stars
39
Forks
350
Watchers
An innovative library for efficient LLM inference via low-bit quantization