ppl.nn.llm icon indicating copy to clipboard operation
ppl.nn.llm copied to clipboard

怎么设置 kv cache int8 量化, 但 a 和 w 仍然是f16,测试 kvcache 量化的收益

Open seeyourcell opened this issue 1 year ago • 4 comments

seeyourcell avatar Oct 24 '23 11:10 seeyourcell