self-operating-computer icon indicating copy to clipboard operation
self-operating-computer copied to clipboard

feat: integrate omniparserv2 api

Open hetaoBackend opened this issue 9 months ago • 1 comments

What does this PR do?

  • support OmniparserV2 official api
  • integrate Omniparser with Qwen

Requirement/Documentation

  • Omniparser api: https://github.com/microsoft/OmniParser/blob/master/omnitool/omniparserserver/omniparserserver.py

Type of change

  • [x] New feature (non-breaking change which adds functionality)

Mandatory Tasks

  • [x] Make sure you have self-reviewed the code. A decent size PR without self-review might be rejected. Make sure before submmiting this PR you run tests with evaluate.py

hetaoBackend avatar Mar 04 '25 08:03 hetaoBackend

@joshbickett would someone have a moment to look at this? It's a really small PR that could be useful to anyone trying to integrate Omniparserv2, and I add a Qwen-VL integration as a demo as well. Thanks in advance!

hetaoBackend avatar Mar 04 '25 08:03 hetaoBackend