LLaVA-NeXT
LLaVA-NeXT copied to clipboard
Do we have some inference accelerate method for new llava-next-video models?
Hi, Amazing job for new llava-next-video model! Since it has 34B params and maybe need more than 1 GPU, so do we have support some inference accelerate method for new llava-next-video models? like sglang deploy. Thanks~