DeepSpeedExamples icon indicating copy to clipboard operation
DeepSpeedExamples copied to clipboard

zero3 and enable hybrid engine are not suitable for llama2, how to solve it?

Open terence1023 opened this issue 1 year ago • 3 comments

In my experiment, I found that if I use zero3 and enable hybrid engine setting, the Actor will generate repeat token or nothing during stage 3 (PPO) training. Here is an example:

image

Besides, I took some other experiments: Experiment 1: use the zero2 and enable hybrid engine, which is fine.

Experiment 2: use the zero3 and disenable hybrid engine, which is fine.

Experiment 3: I tested the OPT model, which is fine in zero3, and enable hybrid engine setting.

I'm confused by this phenomenon and want to know how to fix it. That said, due to time and GPU memory constraints, I want to use zero3 and enable hybrid engine settings.

terence1023 avatar Feb 26 '24 14:02 terence1023

@arashb @ShadenSmith @jeffra @selfReference Please help to check this problem, thanks!

terence1023 avatar Feb 27 '24 02:02 terence1023

Hi, @terence1023. I am facing the same issue. Could you please let me know if you have found a solution to this issue? Any guidance or suggestions you could provide would be greatly appreciated.

Thank you for your time and assistance.

luhengtong avatar May 28 '24 06:05 luhengtong

请问解决了吗

ouyanmei avatar Aug 20 '24 10:08 ouyanmei