LLaVA-NeXT icon indicating copy to clipboard operation
LLaVA-NeXT copied to clipboard

Demo deployment issues

Open pedrocolon93 opened this issue 1 year ago • 4 comments

Hi there! I clone the tensors here: git clone https://huggingface.co/lmms-lab/llava-next-interleave-7b And I do the setup as is in the readme (which needs an upgrade for gradio (pip install --upgrade gradio) and needs numpy==1.23.0) and when i do inference in gradio (with the examples) I get garbage. Is there anything I am missing? image

pedrocolon93 avatar Jun 28 '24 23:06 pedrocolon93

As a side note also needs to install flash attention: pip install flash-attn

pedrocolon93 avatar Jun 28 '24 23:06 pedrocolon93

As a second side note same thing for the -dpo model.

pedrocolon93 avatar Jun 28 '24 23:06 pedrocolon93

FIxed this by cloning the repo and adding -qwen- in the name of the repo... Otherwise it loads some other Llava architecture which does not work.

pedrocolon93 avatar Jun 29 '24 00:06 pedrocolon93

If loading in 4 bit, the line in builder.py kwargs["load_in_4bit"] = True needs to be commented.

pedrocolon93 avatar Jun 29 '24 00:06 pedrocolon93

You should change the model path from llava-next-interleave-7b to llava-next-interleave-qwen-7b and try again.

HaoZhang534 avatar Jul 01 '24 23:07 HaoZhang534

Thanks, and double check the: If loading in 4 bit, the line in builder.py kwargs["load_in_4bit"] = True needs to be commented and adding in the flash-attn dependency

pedrocolon93 avatar Jul 01 '24 23:07 pedrocolon93