DeepSpeedExamples [BUG] DeepSpeed-Chat Step3 - actor model repeats generating the same token when hybrid engine enabled

[BUG] DeepSpeed-Chat Step3 - actor model repeats generating the same token when hybrid engine enabled

Open GeekDream-x opened this issue 1 year ago • 9 comments

Keep other settings the same, when enabling the hybrid engine, the actor model in Step 3 generates the same token one by one until reaching the max length of the answer (id 29962 is the end of my prompt, the repeated token id is 517):

Screenshot 2023-11-30 at 17 53 08

when I disabled the hybrid engine, the actor model generates normally:

Screenshot 2023-11-30 at 17 46 02

Is there anything wrong with the hybrid engine? Thanks!

Nov 30 '23 10:11 GeekDream-x

Model: llama-2

Dec 01 '23 15:12 GeekDream-x

It repeats to generate '\n' more frequently. Screenshot 2023-12-07 at 09 38 31

Dec 07 '23 01:12 GeekDream-x

想问您一下，您把 --hybrid_engine_enabled参数去掉以后，训练速度是不是特别慢

Dec 15 '23 03:12 zjintheroom

想问您一下，您把 --hybrid_engine_enabled参数去掉以后，训练速度是不是特别慢

@zjintheroom 是的，耗时增加了一倍

Dec 15 '23 06:12 GeekDream-x

想问您一下，您把 --hybrid_engine_enabled参数去掉以后，训练速度是不是特别慢

@zjintheroom 是的，耗时增加了一倍

谢谢您的回复，想问您一下，您的配置文件方便给一下么，actor model 和 rewarded model 的zero stage，您这边是怎么选的呢

Dec 15 '23 06:12 zjintheroom

想问您一下，您把 --hybrid_engine_enabled参数去掉以后，训练速度是不是特别慢

@zjintheroom 是的，耗时增加了一倍

谢谢您的回复，想问您一下，您的配置文件方便给一下么，actor model 和 rewarded model 的zero stage，您这边是怎么选的呢

都是stage 3，这个相关的设置基本与DS-Chat保持一致

Dec 15 '23 08:12 GeekDream-x

@GeekDream-x 您好，我也遇到了同样的，想问一下您最终是否找到了解决方法，是否可以提供一些解决该问题的思路，谢谢🙏