LLaVA-NeXT
LLaVA-NeXT copied to clipboard
How to deploy this model via API?
How do we deploy this model via API? Can I deploy it on vLLM or lmdeploy? I can't find any example to run this with HuggingFace transformers.
I want to deploy 72b and 110b model
Hi @Iven2132 , see that you have already noticed our PR in sglang.
For others that have similar problems and reading this issue, you can refer to here
Would be ideal to have TGI and vLLM support as well.
I tried TGI but it seems that won't work as the model isn't recognised.