DeepSeek-Coder-V2
DeepSeek-Coder-V2 copied to clipboard
What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?
Hey there,
Trying to fine-tune your model. What is the FSDP value for fsdp_transformer_layer_cls_to_wrap?
Thanks!