zero-bubble-pipeline-parallelism
zero-bubble-pipeline-parallelism copied to clipboard
More general ZBV scheduling
Currently the limitation is that (number_of_layers / number_of_stage)
needs to be a even number.