SnapKV
SnapKV copied to clipboard
Can snapkv compress kv in case different user questions are posed towards the same context?
Say there is a long document, then two users ask two different questions based on the document. These two questions are no way similar, targeting on different part of the document. In this case, can snapkv compress the context robustly?