Billy Cao comments

Results 302 comments of


                                            Billy Cao

easyocr is useless in demo, why use it?

Yes, I can make a PR if you wish, I already have it impl in my fork: https://github.com/aliencaocao/OmniParser/commit/a3fe0e11c5b67a6ed8de916cdd77ddffba4d135f

easyocr is useless in demo, why use it?

Setting to 0.7 or 0.5 both did not make things better for this particular example: ![Image](https://github.com/user-attachments/assets/1dd507f0-ff92-4945-9245-7d4ea75fba2c) I still think paddleocr is a clear better alternative.

easyocr is useless in demo, why use it?

> > Yes, I can make a PR if you wish, I already have it impl in my fork: [aliencaocao@a3fe0e1](https://github.com/aliencaocao/OmniParser/commit/a3fe0e11c5b67a6ed8de916cdd77ddffba4d135f) > > Feel free to make a PR PR made

Prebuilt wheel for Windows

yep can confirm ` pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu124` on linux also dont work, no wheel.

Add support for batched inference of images of same size

It could be cpu bound because the implementation is essentially a lot of for loops for np arrays. It is surely not as optimized as it could be. The perf...

onnx inference

you can use torch2onnx to convert the caption models yourself. icon detector (yolov8) also includes support for onnx export in Ultralytics docs.

onnx inference

OCR and icon detector run in parallel. Results are merged and overlapping boxes are removed, priotising ocr boxes. The remaining icon boxes are sent to caption. There r 2 models...

Issue detecting some icons

PaddleOCR can fail in icons that look like a letter since it's character based. You can try turning it off.

Add support for export SigLIP models

Hi @xenova I see that you have done it already in https://huggingface.co/Xenova/siglip-large-patch16-384, may I know how did you export it since it is not supported in Optimum yet?

Add support for export SigLIP models

thanks. do you know if this can be used with HF pipeline?