MiniCPM icon indicating copy to clipboard operation
MiniCPM copied to clipboard

[Bad Case]: DPO FP32 Model - Repeat output

Open soulteary opened this issue 1 year ago • 2 comments

Description / 描述

参考图片

Case Explaination / 案例解释

image

soulteary avatar Feb 02 '24 06:02 soulteary

收到,这个情况我们自己也发现了,应该是精度误差以及huggingface attention的实现导致的,目前解决方案是可以开repetition penalty,稍后我们会更新解决方案。 Understood, we have also noticed this issue. It appears to be caused by precision errors and the implementation of huggingface attention. The current solution is to enable repetition penalty, and we will update with a solution later on.

ShengdingHu avatar Feb 02 '24 08:02 ShengdingHu

我在手机上运行也出现了类似的情况,模型一直不停输出相同内容。手机型号是redmi note12turbo,miui14。 d3ee43ea9aa3697edeb55011583b6990_720

watermeko avatar Feb 08 '24 05:02 watermeko