DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

Fix missing scale attributes for GPTJ

Open cmikeh2 opened this issue 1 year ago • 0 comments

This PR fixes two regressions introduced in the DeepSpeed chat release for GPT-J:

  1. Checks for the scale attribute on all parameters before accessing.
  2. Changes workspace offsets to avoid scenario where we are double using a buffer and over-writing data.

cmikeh2 avatar Apr 15 '23 18:04 cmikeh2