OmniParser icon indicating copy to clipboard operation
OmniParser copied to clipboard

Dataset availability

Open nmstoker opened this issue 1 year ago • 9 comments

Thank you for this impressive work -really interesting.

I had a look at the paper, blog post and here, but I cannot see any indication of where the dataset is published - is it available?

I was interested in checking the dataset because I'm seeing several cases where "OK" buttons get classified as icons wth "a button to close or cancel an action" which seems semantically incorrect: OK is more about proceeding, yet this suggests not proceeding.

It would be great to explore examples in the dataset and potentially add more to try to reduce this outcome, but obviously one needs access before that's feasible 🙂

If it's not available now but you plan to put it out, would be great if you could share a rough timeframe (eg just a few days, weeks or longer).

Many thanks!

nmstoker avatar Oct 27 '24 12:10 nmstoker

@nmstoker the ~~dataset~~ weights are automatically downloaded in download.py in https://github.com/microsoft/OmniParser/pull/52. This downloads the weights from https://huggingface.co/microsoft/OmniParser.

Edit: correction

abrichr avatar Oct 30 '24 03:10 abrichr

This isnt the dataset but weights.

aliencaocao avatar Oct 30 '24 03:10 aliencaocao

@yadong-lu - do you have any details regarding dataset availability?

nmstoker avatar Nov 06 '24 01:11 nmstoker

Hi @yadong-lu - I saw you commented a few days back on this matter here

It's great that options are being explored and I appreciate this likely needs time to work through internal processes.

Do you have a rough idea how long it might reasonably take? Eg a few more weeks or is it more like two or three months?

Would be good to keep up the momentum whilst there's plenty of attention on this exciting research, but I totally get how large companies can be 🙂

nmstoker avatar Nov 16 '24 12:11 nmstoker

Yeah large companies often delay the release of some things if it contains proprietary data.

They might be cleaning it up for the release, the wait is good but the delay is bad 😞

Meshwa428 avatar Nov 17 '24 18:11 Meshwa428

Still waiting on 2025/2/22

Carol-gutianle avatar Feb 22 '25 08:02 Carol-gutianle

Still waiting on 2025/3/17

rogerslh avatar Mar 17 '25 12:03 rogerslh

Not worth the wait. Just run omni parser 2 on your GPU or get some cloud options. Get a dataset which contains GUI images and just run it through the model and you can have a dataset with omni parser format which is 60%(maybe ~76%) accurate

Meshwa428 avatar Mar 17 '25 12:03 Meshwa428

still waiting