keras-nlp Add StableLM-3B 4E1T to Keras Hub

Add StableLM-3B 4E1T to Keras Hub

Open Bond099 opened this issue 8 months ago • 5 comments

This PR adds the StableLM-3B 4E1T model to Keras Hub. However, numerical matching with the Hugging Face implementation is still in progress.

Mar 18 '25 18:03 Bond099

@divyashreepathihalli Here is a comparison of numerics with Hugging Face in Colab. The results match with an absolute tolerance of 1e-3, but they do not match when using 1e-5. Could you please take a look and suggest some improvements or explanations for this discrepancy?

Mar 22 '25 18:03 Bond099

The numerics is good enough!

Apr 16 '25 05:04 divyashreepathihalli

@Bond099 let's sync this with the latest changes and make sure to run our format script. I'm not exactly sure why non of our CI is running, but I don't think it ran.