Yanli Zhao comments

Repositories
Issues
Comments

Results 5 comments of


                                            Yanli Zhao

Using FSDP

For native FSDP version, feel free to use "transformer_auto_wrap_policy" to wrap your model, also try the new mixedprecision config for bfloat16:)

Using FSDP

@SeanNaren thanks for trying PTD FSDP! 1. would you please print(model) after constructing the whole model? we found some bugs in lightning, seems the outermost model is not wrapped, it...

Pytorch profiler does not support Distributed view for FSDP training

@rohan-varma do you know who can help with this?

[FSDP][RFC] Enforce rank `r`'s current device is `cuda:r`

I think the SPSD and CUDA device only assumption is current FSDP state, " we can provide earlier and cleaner error handling in the case the user forgets to set...

NCCL Backend does not support ComplexFloat data type

> @ezyang Sorry, I assume you mean use `torch.view_as_real`, but I'm unsure how to modify the above DDP example to use it, or do you mean for a custom distributed...