int2 topic

List int2 repositories

neural-speed

350
Stars
39
Forks
350
Watchers

An innovative library for efficient LLM inference via low-bit quantization