[python] fix: Fix BLOOM's embedding mapping for deepspeed chat

Open hyunwoongko opened this issue 2 years ago • 0 comments

What does this PR do?

This PR makes BLOOM model trained on DeepSpeed Chat can be parallelized. DeepSpeed Chat saves checkpoint like "transformer.word_embedding.weight". so I got an error in this part.

Fixes # (issue) #399

Before submitting

[x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[x] Did you read the contributor guideline, Pull Request section?
[x] Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
[ ] Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
[] Did you write any new necessary tests?

Who can review?

@OlivierDehaene

Jun 03 '23 03:06 hyunwoongko

text-generation-inference text-generation-inference copied to clipboard

[python] fix: Fix BLOOM's embedding mapping for deepspeed chat

What does this PR do?

Before submitting

Who can review?

text-generation-inference
text-generation-inference copied to clipboard