self-operating-computer
self-operating-computer copied to clipboard
feat: integrate omniparserv2 api
What does this PR do?
- support OmniparserV2 official api
- integrate Omniparser with Qwen
Requirement/Documentation
- Omniparser api: https://github.com/microsoft/OmniParser/blob/master/omnitool/omniparserserver/omniparserserver.py
Type of change
- [x] New feature (non-breaking change which adds functionality)
Mandatory Tasks
- [x] Make sure you have self-reviewed the code. A decent size PR without self-review might be rejected. Make sure before submmiting this PR you run tests with evaluate.py
@joshbickett would someone have a moment to look at this? It's a really small PR that could be useful to anyone trying to integrate Omniparserv2, and I add a Qwen-VL integration as a demo as well. Thanks in advance!