VILA
VILA copied to clipboard
LongVILA - compatibility with other LLMs
Hi! very impressed by your work with LongVILA. I would like to do long context with Qwen/LLaMA3.1, but currently, i only see support for Mixtral.
Any chance you plan on releaseing long context support for these models soon?
Best, Orr
Looking at the huggingface release -- it appears that the base LLM is LLaMA3.1? If yes -- could you please advise how this is implemented? I want to use this on my own model for long-context support, is this some attention implementation add-on?
LongVILA models and docs have been released. Close for now. Feel free to reopen if further questions.