Chien-Chin Huang

Results 28 issues of Chien-Chin Huang

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #92184 Current design of FSDP only support NamedOptimizer/KeyedOptimizer when use_orig_params is True this PR adds the support even if use_orig_params if False....

release notes: distributed (fsdp)

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #91343

ciflow/trunk
release notes: distributed (fsdp)

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #92118 Make optim_state_dict and optim_state_dict_to_load public APIs and consolidate them with state_dict by using the same state_dict_type to decide how to perform...

ciflow/trunk
release notes: distributed (fsdp)

Summary: Print out more useful error message for optim_state_dict Test Plan: CI Reviewed By: wz337 Differential Revision: D43556073

fb-exported
ciflow/trunk
release notes: distributed (fsdp)

This PR uses shared memory to do async checkpoint on another process and also implements async staging (overlapping staging with the next iteration).

CLA Signed

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #125339 * #125338 * #125337 * #125336 * __->__ #125335 * #125334 * #125333 Summary: Right now DCP only unflatten a container if...

oncall: distributed
ciflow/trunk
ciflow/periodic
module: distributed_checkpoint

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #125339 * __->__ #125338 * #125337 * #125336 * #125335 * #125334 * #125333 Summary: This is useful if users would like to...

oncall: distributed
ciflow/trunk
ciflow/periodic
module: distributed_checkpoint

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #125339 * #125338 * #125337 * __->__ #125336 * #125335 * #125334 * #125501 Summary: distributed_state_dict should not try to use `getattr` to...

oncall: distributed
ciflow/trunk
ciflow/periodic
module: distributed_checkpoint

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #125339 * #125338 * #125337 * #125336 * #125335 * #125334 * #125333 Summary: This is useful if users would like to...

oncall: distributed
ciflow/trunk
ciflow/periodic
module: distributed_checkpoint

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #125339 * #125338 * #125337 * #125336 * #125335 * __->__ #125334 * #125333 Summary: If an object only exists on certain non-coordinator...

oncall: distributed
ciflow/trunk
ciflow/periodic
module: distributed_checkpoint