multimodal-maestro
multimodal-maestro copied to clipboard
Bug at generate marks
Search before asking
- [X] I have searched the Multimodal Maestro issues and found no similar bug report.
Bug
Traceback (most recent call last):
File "/data/megvii/projects/Qwen-VL/scripts/test_maestro.py", line 7, in
Environment
Ubuntu 20.04 python=3.10.10
Minimal Reproducible Example
No response
Additional
No response
Are you willing to submit a PR?
- [ ] Yes I'd like to help by submitting a PR!
Hi @jyC23333! 👋🏻 Could you share the code you run and image you used? Basically something more so I can understand better what happens?
Hi,@SkalskiP ,my code shows below:
import cv2
import maestro
image = cv2.imread('./鲫鱼.png')
generator = maestro.SegmentAnythingMarkGenerator(device='cuda')
marks = generator.generate(image=image)
marks = maestro.refine_marks(marks=marks)
mark_visualizer = maestro.MarkVisualizer()
marked_image = mark_visualizer.visualize(image=image, marks=marks)
something wrong at this line:
marks = generator.generate(image=image)
original image:
Thanks @jyC23333. I'll get back to you!
Hi @jyC23333 👋🏻 I just tested using the latest version of maestro
, and everything works.
Take a look at this Colab.
Make sure to use the 0.1.1rc1
version. Please let me know if that helped.
@SkalskiP Hi,the bug still exists with the latest version.
This is my cuda info:
And I'm using torch 2.1.0:
Hi @jyC23333, the bug doesn't exist in the Colab I sent you, so it must be environment-dependent. What version of transformers
and supervision
you have?
@SkalskiP Hi,the dependent info is:
To address the bug, add debug logs around self.pipeline in generate to check data shapes and integrity. Monitor GPU usage to ensure no memory overflow—silent issues may not trigger clear error messages
I'm very sorry, but I'm extremely busy. I'll try to look at it as soon as possible.
The maestro
project has pivoted in the direction of a VLM fine-tuning toolkit. As a result, I am closing legacy issues.