DeepSpeedExamples issues

Bump joblib from 0.16.0 to 1.2.0 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

5

Bumps [joblib](https://github.com/joblib/joblib) from 0.16.0 to 1.2.0. Changelog Sourced from joblib's changelog. Release 1.2.0 Fix a security issue where eval(pre_dispatch) could potentially run arbitrary code. Now only basic numerics are supported....

dependabot[bot]

dependencies

Bump urllib3 from 1.25.8 to 1.26.5 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

5

Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5. Release notes Sourced from urllib3's releases. 1.26.5 :warning: IMPORTANT: urllib3 v2.0 will drop support for Python 2: Read more in the v2.0 Roadmap Fixed...

dependabot[bot]

dependencies

Bump numpy from 1.19.2 to 1.22.0 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

5

Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...

dependabot[bot]

dependencies

Bump notebook from 6.1.5 to 6.4.12 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

5

Bumps [notebook](http://jupyter.org) from 6.1.5 to 6.4.12. [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=notebook&package-manager=pip&previous-version=6.1.5&new-version=6.4.12)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) You can trigger a rebase of this PR by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- Dependabot commands...

dependabot[bot]

dependencies

Bump pytorch-lightning from 1.0.4 to 1.6.0 in /MoQ/huggingface-transformers/examples/research_projects/seq2seq-distillation

5

Bumps [pytorch-lightning](https://github.com/PyTorchLightning/pytorch-lightning) from 1.0.4 to 1.6.0. Release notes Sourced from pytorch-lightning's releases. PyTorch Lightning 1.6: Support Intel's Habana Accelerator, New efficient DDP strategy (Bagua), Manual Fault-tolerance, Stability and Reliability. The...

dependabot[bot]

dependencies

Bump pytorch-lightning from 1.0.4 to 1.6.0 in /MoQ/huggingface-transformers/examples/research_projects/rag

5

Bumps [pytorch-lightning](https://github.com/PyTorchLightning/pytorch-lightning) from 1.0.4 to 1.6.0. Release notes Sourced from pytorch-lightning's releases. PyTorch Lightning 1.6: Support Intel's Habana Accelerator, New efficient DDP strategy (Bagua), Manual Fault-tolerance, Stability and Reliability. The...

dependabot[bot]

dependencies

[Deepspeed-chat] Can you release model weights for different stages to HF-hub?

1

Hello, I have successfully ran through the three stages. But I had to make some cuts on the batch size / lora training. I don't have a good baseline on...

kouroshHakha

question

deespeed chat

Can deepspeed support lora in peft, especially the future multi-adapter version?

1

A recent branch of peft is about to support multiple lora adapters. This implementation feels very suitable for the training in ppo stage. An sft model can be used as...

piekey1994

question

deespeed chat

Does it support lora and pipeline parallel now?

3

I found that the memory usage is very large even when using zero3 and lora, so I was wondering whether can I support pipeline parallelism or Tensor parallelism?

blldd

question

deespeed chat

the memory usage of zero3 is larger than zero1

1

When I run the step1_supervised_finetuning script, I find that the memory usage of zero3 is larger than that of zero1, which seems unreasonable. Is there any other optimization here?

blldd

question

deespeed chat

DeepSpeedExamples
DeepSpeedExamples copied to clipboard

Metadata

Bump joblib from 0.16.0 to 1.2.0 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

Bump urllib3 from 1.25.8 to 1.26.5 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

Bump numpy from 1.19.2 to 1.22.0 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

Bump notebook from 6.1.5 to 6.4.12 in /MoQ/huggingface-transformers/examples/research_projects/lxmert

Bump pytorch-lightning from 1.0.4 to 1.6.0 in /MoQ/huggingface-transformers/examples/research_projects/seq2seq-distillation

Bump pytorch-lightning from 1.0.4 to 1.6.0 in /MoQ/huggingface-transformers/examples/research_projects/rag

[Deepspeed-chat] Can you release model weights for different stages to HF-hub?

Can deepspeed support lora in peft, especially the future multi-adapter version?

Does it support lora and pipeline parallel now?

the memory usage of zero3 is larger than zero1

← Metadata

Owner

Metadata

DeepSpeedExamples DeepSpeedExamples copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepSpeedExamples
DeepSpeedExamples copied to clipboard