mixtral-offloading
mixtral-offloading copied to clipboard
Support DeepSeek V2 model
DeepSeek V2 is a state-of-the-art moe model. Are there any plans to support this model?