vision-agent
vision-agent copied to clipboard
Support Multi-Class/Object Detection in a Single Call
The current implementation only supports detecting one object type per call. For real-world, we often need to detect multiple object types simultaneously. This limitation forces users to make sequential API calls, leading to: Inefficiency, Resource Waste.