ColossalAI
ColossalAI copied to clipboard
[autoparallel] refactor metainfo forward memory calculation
What's New?
In this PR, I refactor the forward memory calculation of former patched operations. After this PR, in SPMD solver, we will use the conservative way to estimate the memory, i.e. always believing that shape consistency will create a new tensor for TP.