exo icon indicating copy to clipboard operation
exo copied to clipboard

Load distribution between nodes.

Open morlz opened this issue 1 year ago • 2 comments

I think it is worth sorting nodes by flops and executing more layers on those nodes where there is more performance. This will speed up execution in cases where the model size is smaller than the cluster memory. It is necessary to pass the size of the current model to PartitioningStrategy for this to work.

morlz avatar Feb 14 '25 13:02 morlz

I can try to implement this

morlz avatar Feb 14 '25 13:02 morlz

This will be the default in exo v2!

AlexCheema avatar Feb 14 '25 21:02 AlexCheema

A version of this is in EXO 1.0 - plan to improve this though.

rltakashige avatar Dec 18 '25 18:12 rltakashige

Please open again if this is still an issue!

rltakashige avatar Dec 18 '25 18:12 rltakashige