BaaBaa

Results 1 comments of BaaBaa

Hi all, I try to use speculative decoding with tp=8 and pp=2 on the 2 x 8H20 testbed, with following command: ``` vllm serve /vllm-workspace/DeepSeek-R1/ \ --host 0.0.0.0 \ --port...