fastllm icon indicating copy to clipboard operation
fastllm copied to clipboard

flm模型和glm2模型输出结果不一致

Open newsongwf opened this issue 1 year ago • 3 comments

newsongwf avatar Jul 11 '23 09:07 newsongwf

glm1 也是不一致:

load from pf + merge + lora + cast to int4.

需要直接load int4的支持

Input : Who is Elon Musk?

Output:
...
m Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome
 proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasomeDjangoDjangoDjango

YerongLi avatar Jul 11 '23 19:07 YerongLi

自带的prompt模版和glm2有差异,改成一样的以后有改善,但是还是有一些差异

newsongwf avatar Jul 12 '23 03:07 newsongwf

目前提升了int4量化方法的精度,在原始模型上效果有较大提升,可以试试

ztxz16 avatar Jul 13 '23 07:07 ztxz16

glm1 也是不一致:

load from pf + merge + lora + cast to int4.

需要直接load int4的支持

Input : Who is Elon Musk?

Output:
...
m Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium Consortium proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome
 proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasome proteasomeDjangoDjangoDjango

@YerongLi ,你好,想了解一下merge lora后,转为int4 的fastllm模型,回答一直重复,这个情况有办法解决吗?现在遇到了同样的问题。原版模型是正常的,一旦merge lora后就变成这样了。

ZhouShaoyang avatar Jul 27 '23 07:07 ZhouShaoyang