int7 topic

List int7 repositories

neural-speed

273
Stars
31
Forks
Watchers

An innovative library for efficient LLM inference via low-bit quantization