unsloth icon indicating copy to clipboard operation
unsloth copied to clipboard

Phi-3 small (7B) and medium (14B)

Open rwl4 opened this issue 1 year ago • 8 comments

It would be great to see these models work!

NotImplementedError: Unsloth: /srv/models/Phi-3-medium-4k-instruct not supported yet! Make an issue to https://github.com/unslothai/unsloth!

Done. :)

rwl4 avatar May 21 '24 17:05 rwl4

We're working on it! Phi 3 medium support will most likely come out first and then small due to differing architecture.

shimmyshimmer avatar May 21 '24 17:05 shimmyshimmer

@rwl4 Currrently we support phi-3 mini via https://colab.research.google.com/drive/1NvkBmkHfucGO3Ve9s1NKZvMNlw5p83ym?usp=sharing and https://huggingface.co/unsloth/Phi-3-mini-4k-instruct-bnb-4bit

danielhanchen avatar May 21 '24 20:05 danielhanchen

any updates on this?

joshib123 avatar May 23 '24 07:05 joshib123

waiting for it! So thanks. :smile:

JackCloudman avatar May 23 '24 18:05 JackCloudman

It's out!! @JackCloudman @joshib123 @rwl4

https://x.com/danielhanchen/status/1793762458437578955

Phi 3 medium and mini. Small will be supported later

shimmyshimmer avatar May 24 '24 08:05 shimmyshimmer

@rwl4 @JackCloudman @joshib123 We support Phi-3 Medium and Mini now! See https://github.com/unslothai/unsloth/releases/tag/May-2024 (also includes Colabs)

Small is still in the works!

Please update Unsloth for local machines. For Colab or Kaggle just refresh and restart the env!

pip uninstall unsloth -y
pip install --upgrade --force-reinstall --no-cache-dir git+https://github.com/unslothai/unsloth.git

danielhanchen avatar May 24 '24 10:05 danielhanchen

Hi @danielhanchen, thanks for the update. But, when I tried to finetune the Phi-3-medium the training loss goes from 1.80 to 0 after first step. Wondering if there is a bug somewhere in the code? PS: the same code worked for other models (such as Llama-3-8b).

joshib123 avatar May 27 '24 14:05 joshib123

@joshib123 I don't think there's a bug - that probably means ur learning rate is too high

danielhanchen avatar May 27 '24 17:05 danielhanchen

@danielhanchen thanks for the impressive work! Any news on phi3 small?

anakin87 avatar Jun 07 '24 00:06 anakin87

@anakin87 No sorry - Small is a vastly different architecture :(

danielhanchen avatar Jun 09 '24 14:06 danielhanchen