Prince Canuma
Prince Canuma
Thank you very much! We need to investigate this further but it's hard because I only speak Portuguese, Spanish, English, and a bit of Polish and Hindi. Could you help...
Hey @JoeJoe1313 @leoho0722 This is an issue on the transformers side It seems the Qwen2.5VLImageProcessor class was delete as it's identical with Qwen2VL. The fix is to either: 1. Change...
Thanks @neilmehta24! It will definetly be.
Could you share the specs of your machine?
I would recommend: 1. Trying 8bit or 4bit quants. 2. Trying the 2B version. 3. Or lowering the resolution further to 512 or 224
Awesome! It should work fine if you just lower the resolution. I have M3 Max with 96GB URAM. I can run this example in under a minute: https://github.com/Blaizzy/mlx-vlm/blob/62bb0ee2f57354de4cd27e42be593049269353a4/examples/video_generation.ipynb
> Ok, Thanks My pleasure!
Closing stale
Hey, I just tried it. It works well on demo samples but fails with custom UIs Check the screen resolution they are using and the prompting strategy
Qwen2vl needs to normalise their bbox to 1000