Olatunji Ruwase comments

Results 612 comments of


                                            Olatunji Ruwase

trafficstars

some fix in flops_profiler

@lucasleesw, thanks for the PR. Can you please use [this](https://github.com/microsoft/DeepSpeed/blob/master/CONTRIBUTING.md#prerequisites) to resolve the formatting issues

DeepSpeed initialization with GNN-like model

@buttercutter, you are missing a [deepspeed config file](https://www.deepspeed.ai/getting-started/#launching-deepspeed-training) on the command passed by `--deepspeed_config`. Alternatively, you can pass a dict as [config_params](https://deepspeed.readthedocs.io/en/latest/initialize.html#training-initialization) to `deepspeed.initialize()`

DeepSpeed initialization with GNN-like model

DeepSpeed configuration is meant to be network-agnostic, so in reality that configuration file would work except for `auto` fields which are defined for the HF frontend. The configuration file is...

DeepSpeed initialization with GNN-like model

Set "train_micro_batch_size_per_gpu" to 8 in the configuration file.

DeepSpeed initialization with GNN-like model

It should be, but please report any issues.

DeepSpeed initialization with GNN-like model

Sorry, it appears #1149 was never merged. Unfortunately, it has a conflict with master. Can you please try picking that up?

DeepSpeed initialization with GNN-like model

@buttercutter, #1149 is now merged. Please try master.

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

@pacman100, we made some recent changes to ZeRO 3, so I wonder if those are source of regression. Could you please try v0.6.0?

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

@pacman100, thanks for the update. So, you see a totally different error with old release :(. Let's ignore the old release then. I will work on setting up a repro...

[BUG] DeBERTa has bad performance when using ZERO Stage-3 with continuous warnings "A module has unknown inputs or outputs type"

@pacman100, thanks for sharing your update. I am glad that performance problem is resolved in the latest code. I have created this #1974 to suppress the warning noise. The PR...