AinL

Results 14 comments of AinL
trafficstars

Now I understand that the concept of hierarchical coaching is trying to aim more general framework than what I thought. I will keep watching this PR, and I will implement...

@Edenzzzz Yes, I used cudaMemAdvise to make the pages stay mostly in the CPU. So, if what I understand is correct, the latency should be stable from the CPU side....

>If you are on the latest version of vLLM you'll need Pytorch 2.4. Thank you for reply. I am recompiling vLLM from source with PyTorch 2.3.0, because I need to...