kineto
kineto copied to clipboard
Pytorch profiler does not support Distributed view for FSDP training
Running the FSDP training (released with PT 1.11) which is a type of DDP, it is expected to have distributed view in Pytorch profile. The distributed view is currently missing from profiler for FSDP.
Following are snapshots from FSDP vs DDP Profiler view.
FSDP training
DDP training.
Feature request
Adding support for FSDP in Distributed view of Profiler.
cc @zhaojuanmao , @mrshenli
@rohan-varma do you know who can help with this?