jetstream-pytorch
jetstream-pytorch copied to clipboard
Add deepseek distils as options
Small change that allows directly using the recently released DeepSeek R1 Distils.
Tested on TPU v4-8 for "deepseek-ai/DeepSeek-R1-Distill-Llama-8B" and it worked.