TEN-Agent icon indicating copy to clipboard operation
TEN-Agent copied to clipboard

[FEATURE] Hope to add the feature of multimodal or vision understanding through IP camera

Open guihaoqun opened this issue 9 months ago • 1 comments

Description

The ten framework can implement the local camera to obtain the video stream, but the reality is more about IP cameras,so Hope to add the feature of multimodal or vision understanding through IP camera

Severity

Critical

Additional Information

nothing

guihaoqun avatar Mar 27 '25 07:03 guihaoqun

could you pls describe the use case in more detail? how will you interact with agent? via the mic on ip camera?

plutoless avatar Mar 31 '25 03:03 plutoless