Max Zvyagin

Results 1 comments of Max Zvyagin

Hi Akarsh, thanks for checking this out! I guess my question is partially what the reason is for calling `enable_transformers_pretrained_deepspeed_sharding(self)` in the setup() function vs in the constructor, and if...