Agent-S icon indicating copy to clipboard operation
Agent-S copied to clipboard

feat(grounding): add mixture-of-grounding fallback to generate_coords

Open jayeshthk opened this issue 9 months ago • 0 comments

  • Introduce mixture_generate_coords that first tries LLM-based grounding and falls back to OCR on failure for more robust coordinate lookup
  • Update assign_coordinates to use the new mixture method for click and drag-and-drop actions

This improves resilience when the primary grounding model cannot locate elements.

jayeshthk avatar Apr 10 '25 06:04 jayeshthk