fastcomposer icon indicating copy to clipboard operation
fastcomposer copied to clipboard

The speed of saving models during multi machine and multi card training is very slow

Open JarvisFei opened this issue 2 years ago • 1 comments

Have you tried training this model on multiple machines?

If you have tried, is there anything special to pay attention to in terms of environment settings and parameters?

JarvisFei avatar Aug 03 '23 14:08 JarvisFei

Yes, we have. It's important to ensure a good inter-node connection to prevent communication bottlenecks during training.

Guangxuan-Xiao avatar Aug 10 '23 22:08 Guangxuan-Xiao