kaizen icon indicating copy to clipboard operation
kaizen copied to clipboard

ENH: Add support for media

Open MashyBasker opened this issue 1 year ago • 2 comments

Enhance our PR description and issue label generators to support multimedia inputs (screenshots, videos, GIFs) in addition to text, for more comprehensive content analysis.

MashyBasker avatar Aug 12 '24 17:08 MashyBasker

Are we using any vision models as of now?

SwapnilChand avatar Sep 11 '24 22:09 SwapnilChand

I don't think so. Maybe we could use LLaVA?

MashyBasker avatar Sep 12 '24 05:09 MashyBasker