int7 topic
List
int7 repositories
neural-speed
273
Stars
31
Forks
Watchers
An innovative library for efficient LLM inference via low-bit quantization
An innovative library for efficient LLM inference via low-bit quantization