QLLM icon indicating copy to clipboard operation
QLLM copied to clipboard

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

Results 1 QLLM issues
Sort by recently updated
recently updated
newest added

Thanks for your wonderful works. Is there a bug in: https://github.com/ModelTC/QLLM/blob/main/models/int_llama_layer.py#L291