ColossalAI
ColossalAI copied to clipboard
[autoparallel] remove redundancy comm node
This PR mainly does two things:
- remove redundancy comm node: if the sharding spec of node output is same as the user node required sharding spec, we will not insert any node between this pair of nodes.
- The resharding costs just count the element numbers, size_per_elem_bytes is added in this PR.