LLaVA icon indicating copy to clipboard operation
LLaVA copied to clipboard

Production deployment

Open ArunAniyan opened this issue 1 year ago • 1 comments

Question

Hi, What is the best infrastructure and methodology to deploy llava for a production-grade application? Is a local application server like Ollama advisable? Do you know of other possible methods? Apart from ollama, llama.cpp is something that comes to mind. Have not tried triton-llm.

ArunAniyan avatar Feb 09 '24 11:02 ArunAniyan

Sglang supports llava https://github.com/sgl-project/sglang

nivibilla avatar Feb 10 '24 08:02 nivibilla