gc
Results
1
comments of
gc
Thank you for the quick reply. I see your point. In my config, I assume the CP group share the same batch of data. So each dp_shard rank takes 1/8...