llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

Feature Request: Support for Qwen2-VL

Open isr431 opened this issue 1 year ago • 130 comments

Prerequisites

  • [X] I am running the latest code. Mention the version if possible as well.
  • [X] I carefully followed the README.md.
  • [X] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • [X] I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Qwen just released Qwen2-VL 2B & 7B under the Apache 2.0 License.

Motivation

SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. Understanding videos of 20min+: Qwen2-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc.

Possible Implementation

No response

isr431 avatar Aug 29 '24 22:08 isr431

+1 This would be another great addition!

chigkim avatar Aug 31 '24 00:08 chigkim

This model is awesome

crzroot avatar Aug 31 '24 02:08 crzroot

I am looking forward to it very much

suepradun avatar Aug 31 '24 03:08 suepradun

+1 I am looking forward to it very much

xzlinux avatar Aug 31 '24 12:08 xzlinux

We can try Llamafing it

yukiarimo avatar Aug 31 '24 23:08 yukiarimo

+1

XDesktopSoft avatar Sep 01 '24 02:09 XDesktopSoft

+1

WildCatApp avatar Sep 01 '24 11:09 WildCatApp

+1

uestcbraid avatar Sep 02 '24 01:09 uestcbraid

+1

mrhalyang avatar Sep 02 '24 02:09 mrhalyang

+1

elyzionz avatar Sep 02 '24 05:09 elyzionz

+1

eaucoin avatar Sep 02 '24 20:09 eaucoin

+1

Kimizhao avatar Sep 03 '24 03:09 Kimizhao

+1

enryteam avatar Sep 04 '24 09:09 enryteam

Any updates?

yukiarimo avatar Sep 04 '24 10:09 yukiarimo

+1

apipino avatar Sep 05 '24 01:09 apipino

+1

Xhehab avatar Sep 05 '24 04:09 Xhehab

+1

Seaman3body avatar Sep 05 '24 13:09 Seaman3body

+1

zenoverflow avatar Sep 05 '24 15:09 zenoverflow

+1

whoisltd avatar Sep 06 '24 03:09 whoisltd

+1

eav-solution avatar Sep 07 '24 16:09 eav-solution

I can not wait for it !!!

feynmanloo avatar Sep 08 '24 16:09 feynmanloo

Maybe people should also express interest and ask Qwen2-VL devs to implement. https://github.com/QwenLM/Qwen2-VL/issues/7

chigkim avatar Sep 08 '24 19:09 chigkim

Expect to use llama.cpp end side inference

wmx-github avatar Sep 11 '24 01:09 wmx-github

Is anyone already working on this? If not, I would like to give it a try.

HimariO avatar Sep 11 '24 02:09 HimariO

+1 is there any updates?

solangii avatar Sep 11 '24 08:09 solangii

+1

PredyDaddy avatar Sep 12 '24 09:09 PredyDaddy

+1

shobhit9618 avatar Sep 12 '24 12:09 shobhit9618

+1

zhouxihong1 avatar Sep 13 '24 08:09 zhouxihong1

+1

gaurishmehra avatar Sep 15 '24 15:09 gaurishmehra

Could everybody please stop the +1 comments and instead just give a 👍🏽 on the first post? The +1 posts just add noise and make following any real updates annoying...

nightscape avatar Sep 15 '24 20:09 nightscape