keras-nlp
keras-nlp copied to clipboard
Sharding for model loading
As we are progressing towards welcoming Large models in library, we need a sharding mechanism for loading large checkpoints in model.
Hi @mattdangerw , Here is how huggingface shards the model. This is fairly simple strategy.
I think @nkovela1 will be taking a look here!