xla
xla copied to clipboard
How to use spmd to support hybrid shard data parallelism?
❓ Questions and Help
Fsdp can be well expressed by spmd, but hsdp seems to be unable to be expressed. Is there any way to express hsdp in spmd?