LLaVA-NeXT Demo deployment issues

Demo deployment issues

Open pedrocolon93 opened this issue 1 year ago • 4 comments

Hi there! I clone the tensors here: git clone https://huggingface.co/lmms-lab/llava-next-interleave-7b And I do the setup as is in the readme (which needs an upgrade for gradio (pip install --upgrade gradio) and needs numpy==1.23.0) and when i do inference in gradio (with the examples) I get garbage. Is there anything I am missing?

Jun 28 '24 23:06 pedrocolon93

As a side note also needs to install flash attention: pip install flash-attn

Jun 28 '24 23:06 pedrocolon93

As a second side note same thing for the -dpo model.

Jun 28 '24 23:06 pedrocolon93

FIxed this by cloning the repo and adding -qwen- in the name of the repo... Otherwise it loads some other Llava architecture which does not work.

Jun 29 '24 00:06 pedrocolon93

If loading in 4 bit, the line in builder.py kwargs["load_in_4bit"] = True needs to be commented.

Jun 29 '24 00:06 pedrocolon93

You should change the model path from llava-next-interleave-7b to llava-next-interleave-qwen-7b and try again.

Jul 01 '24 23:07 HaoZhang534

Thanks, and double check the: If loading in 4 bit, the line in builder.py kwargs["load_in_4bit"] = True needs to be commented and adding in the flash-attn dependency

Jul 01 '24 23:07 pedrocolon93

LLaVA-NeXT LLaVA-NeXT copied to clipboard

Demo deployment issues

LLaVA-NeXT
LLaVA-NeXT copied to clipboard