Sourab Mangrulkar comments

Results 244 comments of


                                            Sourab Mangrulkar

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

@stas00, pinging you in case you have encountered this before.

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

Hello Stas, Thank you for the information. I observe it with trainer too. Steps to reproduce the behaviour with trainer: 1. Official `run_glue.py` [script](https://github.com/huggingface/transformers/blob/main/examples/pytorch/text-classification/run_glue.py) with the following change. The change...

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

> Also you're running pt-nightly - I wonder if this is something new in pytorch? Does it work with pt-1.11 Yes, this is on pt-nightly. However, I believe it has...

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

Hello @tjruwase, Getting below error with v0.6.0: ``` Traceback (most recent call last): File "/home/sourab/deepspeed-test/src/text-classification/run_glue_no_trainer.py", line 619, in main() File "/home/sourab/deepspeed-test/src/text-classification/run_glue_no_trainer.py", line 511, in main accelerator.backward(loss) File "/home/sourab/accelerate/src/accelerate/accelerator.py", line 616,...

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

Hello @tjruwase , I tried rerunning using the latest release with multiple and single GPU(s) setup. I don't observe accuracy issue anymore (above might have used different DeBERTa pretrained checkpoint...

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

Hello @tjruwase, Thank you for the fix 😄! Yes, the above PR is working as expected to suppress the warnings.

Sourab Mangrulkar

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

FSDP - TypeError: load_state_dict() got an unexpected keyword argument 'strict'

FSDP - TypeError: load_state_dict() got an unexpected keyword argument 'strict'

[WIP] DeepSpeed launcher related changes

Multi-node training on 2 A100 machines.