junyan-zg

Results 2 comments of junyan-zg

Successfully verified on v0.17.0 !

@bbss Is the dynamic quant one working now ? ` >>> from vllm import LLM INFO 03-20 12:43:57 [__init__.py:256] Automatically detected platform cuda. >>> import torch >>> model_id = "/opt/Qwen2.5-VL-72B-Instruct-unsloth-bnb-4bit"...