GenAIExamples icon indicating copy to clipboard operation
GenAIExamples copied to clipboard

Update VisualQnA example with Falcon VLM

Open arun-gupta opened this issue 1 year ago • 2 comments
trafficstars

Update VisualQnA example that uses Falcon VLM.

This would require to include Falcon as part of the validation at https://github.com/opea-project/GenAIComps/tree/main/comps/llms. And then create an updated VisualQnA that would use this microservice to use Falcon VLM.

arun-gupta avatar Aug 08 '24 22:08 arun-gupta

Supporting Falcon-11B would be great.

lucasmelogithub avatar Aug 09 '24 18:08 lucasmelogithub

TGI-Gaudi did not support this model: https://huggingface.co/tiiuae/falcon-11B-vlm

we need to wait for TGI-Gaudi

kevinintel avatar Aug 29 '24 08:08 kevinintel

@kevinintel can we move forward with this issue now?

chickenrae avatar Oct 31 '24 16:10 chickenrae

we can't do anything unless tgi-gaudi supports it

kevinintel avatar Nov 01 '24 07:11 kevinintel

@kevinintel can this be done using a larger Xeon instance?

arun-gupta avatar Nov 01 '24 15:11 arun-gupta

@kevinintel can this be done using a larger Xeon instance? image

VisualQnA works great on Xeon, I tested with a AWS "c7i.24xlarge" instance (96 vCPU 4th Gen Xeon w/ Intel AMX) using the llava-hf/llava-v1.6-mistral-7b-hf Model.

But when using tiiuae/falcon-11B-vlm, the "tgi-llava-xeon-server" microservice errors out.

lucasmelogithub avatar Nov 01 '24 19:11 lucasmelogithub

@kevinintel What are the next steps to resolve this issue?

chickenrae avatar Nov 03 '24 15:11 chickenrae

@arun-gupta @chickenrae

Thanks for create this topic. It's inactive for long time. Closet it now.

If anyone is interested in this topic, feel free to raise a new ticket.

xiguiw avatar Mar 14 '25 08:03 xiguiw