keras-nlp
keras-nlp copied to clipboard
Model weights contributions?
There is a model called SOLAR. This model follows the same architecture as LLaMA2, but it has more layers which make it outstanding performer better than Mistral and even Mixtral at some points (open LLM Leaderboard)
In this case, what could be the contribution points?