vllm
vllm copied to clipboard
[New Model]: Google's Paligemma family of models
The model to consider.
https://huggingface.co/google/paligemma-3b-pt-896
The closest model vllm already supports.
I think the only visual language model supported right now is LLava but I could be wrong.
What's your difficulty of supporting the model you want?
No response
MiniCPM is also supported.
Excited to test out how PaliGemma compares, especially when analyzing GUI images: https://github.com/OpenAdaptAI/OpenAdapt/issues/637
I'm working on a PR for this currently. See #5189