DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

Add BigCode models support

Open cupertank opened this issue 2 years ago • 2 comments

This PR adds support for BigCode models. As you can see in https://github.com/microsoft/DeepSpeed/issues/3811, it's a pretty popular architecture

If you have any questions, please feel free to ask.

cupertank avatar Oct 20 '23 12:10 cupertank

Also, I don't know how to add tests for these models, if someone could help me out with that, I would be very grateful.

cupertank avatar Oct 20 '23 12:10 cupertank

@cupertank - is this still a PR you'd like to see completed?

loadams avatar May 24 '24 17:05 loadams

Closing this PR as stale.

loadams avatar Jan 29 '25 19:01 loadams