lorax icon indicating copy to clipboard operation
lorax copied to clipboard

vision language model support

Open 7uk3y opened this issue 1 year ago • 2 comments
trafficstars

Feature request

The developments in the robotics community around RT-2 show a lot of potential for VLMs but the hardware constraints for small developers makes it difficult to deploy RT-2 level performance with off the shelf hardware suitable for robotics. A potential path forward would be using very small models that have a library of loras to access when the environment calls for them. Lorax looks like it might be a way forward for smaller devs and robot enthusiasts to use and experiment with VLM architectures

Motivation

robots as a hobby

Your contribution

testing and refining available models

7uk3y avatar Jan 11 '24 20:01 7uk3y

Thanks for the filing this issue! I think this shouldn't be too hard to add support for some popular ones like LLaVA.

tgaddair avatar Jan 11 '24 22:01 tgaddair

Hey @tgaddair I would be up to give this a try, could you just give me a basic rundown of what you think needs to be done here to save me some head scratching

pbarker avatar Jan 12 '24 19:01 pbarker