Woo-Yeon Lee

Results 5 comments of Woo-Yeon Lee

Great! Thanks for the PR! I have much interest in vllm and this speculative decoding feature. Actually, I didn't know this was already in progress, and I was testing the...

After a short experiment with this PR code, I'm getting almost the same result with my local branch, based on v0.4.2. I guess that it's due to the comm overhead...

@LiuXiaoxuanPKU Thank you (and other co-workers) for the great work! I checked the SmartSpec paper on arxiv and slides in the recent meetup, and it looks great :) I'm looking...

@cadedaniel Can I contribute my code that already implemented this feature on v0.4.2? I've referred to your code in #2188. I'm aware that #4933 is going on, so I want...