torchtune icon indicating copy to clipboard operation
torchtune copied to clipboard

Support for Gemma 7B

Open 1Krypt0 opened this issue 1 year ago • 2 comments

Hi!

Are you considering adding support for Gemma 7B? It seems that it would be a great addition to the set of available models.

1Krypt0 avatar May 13 '24 08:05 1Krypt0

We're definitely interested in adding more models!

As I understand it, the 2B and 7B architectures are roughly the same (just different sizes for the parameters). If you'd be interested in adding the model yourself, we'd gladly stamp it :) It should be fairly simple, just adding:

  1. A new gemma_7b function in the gemma/_model_builders.py file with the appropriate sizes
  2. A screenshot or attached W&B log showing that the model learns from a simple alpaca fine-tuning
  3. Updating our models docs to show we now support it!

LMK if you have any questions or if this is something you'd like to take on.

joecummings avatar May 13 '24 14:05 joecummings

I am on it!

Optimox avatar May 13 '24 15:05 Optimox

I only now noticed that I didn't close this after merging. Thanks guys! You're doing a great job with this project!

1Krypt0 avatar Jul 07 '24 16:07 1Krypt0