DeepSpeedExamples zero3 and enable hybrid engine are not suitable for llama2, how to solve it?

zero3 and enable hybrid engine are not suitable for llama2, how to solve it?

Open terence1023 opened this issue 1 year ago • 3 comments

In my experiment, I found that if I use zero3 and enable hybrid engine setting, the Actor will generate repeat token or nothing during stage 3 (PPO) training. Here is an example:

Besides, I took some other experiments: Experiment 1: use the zero2 and enable hybrid engine, which is fine.

Experiment 2: use the zero3 and disenable hybrid engine, which is fine.

Experiment 3: I tested the OPT model, which is fine in zero3, and enable hybrid engine setting.

I'm confused by this phenomenon and want to know how to fix it. That said, due to time and GPU memory constraints, I want to use zero3 and enable hybrid engine settings.

Feb 26 '24 14:02 terence1023

@arashb @ShadenSmith @jeffra @selfReference Please help to check this problem, thanks!

Feb 27 '24 02:02 terence1023

Hi, @terence1023. I am facing the same issue. Could you please let me know if you have found a solution to this issue? Any guidance or suggestions you could provide would be greatly appreciated.

Thank you for your time and assistance.

May 28 '24 06:05 luhengtong

请问解决了吗

Aug 20 '24 10:08 ouyanmei

DeepSpeedExamples DeepSpeedExamples copied to clipboard

zero3 and enable hybrid engine are not suitable for llama2, how to solve it?

DeepSpeedExamples
DeepSpeedExamples copied to clipboard