unsloth icon indicating copy to clipboard operation
unsloth copied to clipboard

[WIP] add support for mixtral

Open tohrnii opened this issue 1 year ago • 9 comments

Mixtral WIP

tohrnii avatar Jan 30 '24 13:01 tohrnii

Fantastic and fabulous work @tohrnii!!! Super appreciate it! I will take a look later today!

danielhanchen avatar Feb 02 '24 03:02 danielhanchen

Any update about this pull request?

kaykyr avatar Feb 12 '24 13:02 kaykyr

@kaykyr @danielhanchen @tohrnii You guys open to some collaboration on this? I think I just my Phi2 implementation done (big touch wood) so I'm happy to take a look

cm2435 avatar Feb 12 '24 22:02 cm2435

Apologies, I got stuck on something else. I'd love to collaborate @cm2435. If however you are close to completing the implementation, I'm happy to close this PR in favor of yours.

tohrnii avatar Feb 13 '24 03:02 tohrnii

Hey - thanks again on the PR @tohrnii and super appreciate it again :) Ye more than happy to make this happen and collab with you all @kaykyr @cm2435 - I was just a bit bogged down recently on chat templates and making a UI - I will be much more free next week, then we can make Mixtral happen :)

danielhanchen avatar Feb 13 '24 07:02 danielhanchen

For sure! I am trying to fine tune a MoE pretrained if I had progress I will create pull requests guys. I also able to offer my small server (2x RTX 3090 with NVLink + i9 11900HK + 64GB DDR4) for collaborators who wanna run tests with multi-gpu.

kaykyr avatar Feb 14 '24 13:02 kaykyr

@kaykyr Oh thanks for the kind offer!!! I'll take up for that offer later in the month :)

danielhanchen avatar Feb 15 '24 07:02 danielhanchen

@kaykyr funny you mention that- I've got almost the exact same setup! I'm going to be very sad when they deprecate the SLI bridge as cuda supported hardware

cm2435 avatar Feb 16 '24 20:02 cm2435

Great work. Is there any estimate about when this will be merged?

ilkersigirci avatar Mar 08 '24 10:03 ilkersigirci