Medusa icon indicating copy to clipboard operation
Medusa copied to clipboard

[New feature] llama.cpp support

Open ctlllll opened this issue 1 year ago • 7 comments

We are currently running out of bandwidth. Contributors to help integrate Medusa into llama.cpp would be greatly appreciated :)

ctlllll avatar Sep 18 '23 15:09 ctlllll

What type of skills would be needed? Any specific grunt work in mind? My skillset is probably too narrow if it involves any heavier math but if you can lay it out I might be able to attempt something

kalomaze avatar Sep 18 '23 16:09 kalomaze

It looks like a lot of the groundwork is being laid out here with the parallel decoding implementation: https://github.com/ggerganov/llama.cpp/pull/3228

kalomaze avatar Sep 18 '23 20:09 kalomaze

It looks like a lot of the groundwork is being laid out here with the parallel decoding implementation: ggerganov/llama.cpp#3228

Yeah, that's also what I thought. The tree attention implementation in llama.cpp should be very helpful for integrating Medusa. And I guess the main challenge is the familiarity with their codebase.

ctlllll avatar Sep 18 '23 23:09 ctlllll

We are almost done in supporting medusa in llama.cpp. now working on attention mask part and few other things.

JianbangZ avatar Sep 26 '23 14:09 JianbangZ

I would like to help with finalizing the support for this. Is there any place where I can contact the group behind this project and ask questions?

kalomaze avatar Oct 07 '23 22:10 kalomaze

I would like to help with finalizing the support for this. Is there any place where I can contact the group behind this project and ask questions?

Hi @kalomaze , that would be great, just sent a friend request on Discord :) (I assumed your user name is kalomaze right?)

ctlllll avatar Oct 09 '23 01:10 ctlllll

I would like to help with finalizing the support for this. Is there any place where I can contact the group behind this project and ask questions?

Hi @kalomaze , that would be great, just sent a friend request on Discord :) (I assumed your user name is kalomaze right?)

Sorry, what was your tag? I'm coming back to this now, I figured GitHub would have sent a notification.

kalomaze avatar Oct 28 '23 16:10 kalomaze