Hugo Zhang
[NeurIPS 2023] Model-enhanced Vector Index
HugoZHL
[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference