junyan-zg
Results
2
comments of
junyan-zg
Successfully verified on v0.17.0 !
@bbss Is the dynamic quant one working now ? ` >>> from vllm import LLM INFO 03-20 12:43:57 [__init__.py:256] Automatically detected platform cuda. >>> import torch >>> model_id = "/opt/Qwen2.5-VL-72B-Instruct-unsloth-bnb-4bit"...