Isotr0py comments

Results 139 comments of


                                            Isotr0py

[Misc] Use model_overwrite to redirect the model name to a local folder.

> I think the config is too complicated, and it will also request hf multiple times. I think we just need to redirect to local location when initializing `ModelConfig`, so...

[Misc] Use model_overwrite to redirect the model name to a local folder.

> Otherwise, log output and server name will be very weird. IMO, if we redirect the model_repo to local directory manually, we should also make sure the model's name updated...

[Misc] Use model_overwrite to redirect the model name to a local folder.

> 1. Redirection to local location when initializing ModelConfig. Log output and server names will be very weird. It may even trigger strange bugs @noooop I prefer 1 TBH. Redirection...

[Misc] Use model_overwrite to redirect the model name to a local folder.

The entrypoint test failure should be unrelated, I can confirm it's passed locally. The V1 test is flaky currently. 😅

[Bug] vllm deploy InternVL3_5-241B-A28B error

> same err here > The error can be resolved by installing the specific version vllm==0.10.1.1. Sorry for breaking this. https://github.com/vllm-project/vllm/pull/25146 should fix this.

[Bug] vllm deploy InternVL3_5-241B-A28B error

@Journey7331 I tried but unable to reproduce the tensor schema issue on vLLM's current main branch. The processed pixvel_values from `OpenGVLab/InternVL3_5-30B-A3B-Instruct` indeed has 448x448 size per patch on my side....

[Bug] vllm deploy InternVL3_5-241B-A28B error

Hmmm, I remember that pytorch 2.8 has deprecated cu118 support...

[Bug] vllm deploy InternVL3_5-241B-A28B error

> BTH, pixel_values error only exists with ckpt downloaded from OpenGVLab/InternVL3_5-30B-A3B-HF HF version Oh, I see. HF format models use a different model implementation compared to GitHub format, will take...

[New Model]: Florence-2

Oh, I totally forgot this... 😅 Let me port the ViT for the florence models to finish this.

[Feature]: Application support for the InternVL2.5-78B series models.

This should has been supported, you can try serving the AWQ model with this command: ```shell vllm serve OpenGVLab/InternVL2_5-78B-AWQ --quantization awq --dtype half --max-model-len 4096 ```