Yihua Cheng

Results 77 comments of Yihua Cheng

@soulseen Thanks for you interest! Right now we support Xp1d setup. Please take a look at #528 We will update the documentation soon

@soulseen Just a quick update, the documentation for xp1d is online now: https://docs.lmcache.ai/disaggregated_prefill/nixl/xpyd.html (it's named XpYd, but only 1 decoder instance is supported for now)

@soulseen Seems like there are some problems with the underlying UCX connection (which is used by NIXL) ``` [2025-06-13 08:28:11,553] LMCache INFO: Storing KV cache for 5 out of 5...

@AsicDyc Hey, we don't use NIXL when doing CPU offloading. The high level difference is that TP=2 can have more GPU memory for KV cache than 2x TP=1, which means...

@AsicDyc Right now we only have NIXL for pd disaggregation. We do support 2x TP=1, but we require the prefiller and the decoder having the same TP.

@lengrongfu Hey, we don't have this right now. What's the use case for using etcd?

@lengrongfu Right now we use zmq to directly exchange the nixl agent information between nixl agents.