DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

load linear layer weight with given dtype

Open polisettyvarma opened this issue 1 year ago • 3 comments

bf16 inference fails due to data type mismatch as half is default value

polisettyvarma avatar Jul 26 '23 06:07 polisettyvarma

Please review this

polisettyvarma avatar Aug 08 '23 04:08 polisettyvarma

Hi @jeffra :) can you please help reviewing this change?

nelyahu avatar Sep 21 '23 09:09 nelyahu

Hi @RezaYazdaniAminabadi @mrwyattii @jeffra @awan-10 @cmikeh2 @arashb please review this change.

polisettyvarma avatar Sep 30 '23 01:09 polisettyvarma

Hi @RezaYazdaniAminabadi @mrwyattii @jeffra @awan-10 @cmikeh2 @arashb please review this change.

The changes look good to me 👍 I think we can merge it after fixing the comment from @tjruwase

RezaYazdaniAminabadi avatar Feb 02 '24 06:02 RezaYazdaniAminabadi