Yongkang Chen

Results 4 issues of Yongkang Chen

Does the framework support the mistral 87B model perfectly? I encountered an Out of Memory error during use. The machine is 8*A100 80G. ![image](https://github.com/microsoft/DeepSpeed-MII/assets/81293778/47b18e0e-6b84-4594-b0bf-8b546e4a63ac)