OmniParser
OmniParser copied to clipboard
Do we have finetuned models for more accurate icon classification?
The icon detection is supper great, however the icon classification ( parsed_content_list 'content' field is not so accurate )
If more accurate icon detection result we can feed to LLM directly to generate actions.