KIVI
KIVI copied to clipboard
Why the model inference slowly when Mistral-7B-Instruct-v0.2 apply the kivi?