Harry Mellor

Results 298 comments of Harry Mellor

Can you please tell us the commit hash of the snapshot of R1 you have downloaded? DeepSeek have been making small changes to the `json` files so it'd be good...

@dernosmirc I can't seem to install OpenVINO on my dev machine because it has CUDA devices in it. Can you confirm if #14271 solves the issue?

I know I should be able to, but I get: ``` RuntimeError: Only one platform plugin can be activated, but got: ['openvino', 'cuda'] ``` When I try and run anything.

Sorry it was probably too early to ask you to try it. The error you saw was legitimate. If you have time to test the latest version of that PR...

Looks good from a docs persoective now https://vllm--12010.org.readthedocs.build/en/12010/features/quantization/inc.html

It ultimately came down to architectural complexity in vLLM. Whisper is still supported and multimodal models that do not use cross attention are still supported.

FYI, this model already works with vLLM ```console vllm serve XiaomiMiMo/MiMo-7B-Base --model-impl transformers --trust-remote-code ```