gpt-fast
gpt-fast copied to clipboard
Int4 perplexity
Hi! How does ppl compare between fp16 and your int4?