crewAI MLLM Support

MLLM Support

Open Ryan-ZL-Lin opened this issue 1 year ago • 3 comments

Hi First of all, thanks for your amazing work here. I’m wondering whether CrewAI would support any MLLM (multimodal large language model)? Since it’s more suitable in my use case. For example, the agent can take a image from somewhere and invoke the API from any MLLM to understand the meaning of this image.

Jan 21 '24 05:01 Ryan-ZL-Lin

I'm a new user who's experimented with CrewAI a little and haven't tried it but why not? If you tell it what image to ingest and specify the multimodal model to use then I don't see why not. Try it and report back what you see if you have questions, would be curious to know, too.

Jan 21 '24 12:01 matsuobasho

Yes! we are def going multimodal, I've tested yet but it could work already, that said I want to have a better DSL around it. Marking this as feature accepted

Jan 21 '24 19:01 joaomdmoura

Yes! we are def going multimodal, I've tested yet but it could work already, that said I want to have a better DSL around it. Marking this as feature accepted

Thanks @joaomdmoura look forward seeing this feature released soon.

Jan 22 '24 11:01 Ryan-ZL-Lin

crewAI crewAI copied to clipboard

MLLM Support

crewAI
crewAI copied to clipboard