Yushi Bai comments

Results 102 comments of


                                            Yushi Bai

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

你这里应该是没有成功替换，我们训练时的[modeling_chatglm.py](https://github.com/THUDM/LongWriter/blob/main/train/patch/modeling_chatglm.py)代码中没有这一行：File "/home/hnjj/.cache/huggingface/modules/transformers_modules/glm-4-9b-chat/modeling_chatglm.py", line 416, in init [rank7]: self.core_attention = CORE_ATTENTION_CLASSES[config._attn_implementation](config, self.layer_number)。这是原始hf库中的代码才有的。

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

我们建议从glm-4-9b（base）模型开始进行混训（通用SFT数据+LongWriter-6k数据）。直接从glm-4-9b-chat训练的效果会大打折扣。

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

> > 你这里应该是没有成功替换，我们训练时的[modeling_chatglm.py](https://github.com/THUDM/LongWriter/blob/main/train/patch/modeling_chatglm.py)代码中没有这一行：File "/home/hnjj/.cache/huggingface/modules/transformers_modules/glm-4-9b-chat/modeling_chatglm.py", line 416, in init [rank7]: self.core_attention = CORE_ATTENTION_CLASSES[config._attn_implementation](config, self.layer_number)。这是原始hf库中的代码才有的。 > > 我试了确实是，替换了原来的文件后，运行train文件，就会使用的还是原来的modeling_chatglm.py文件你需要在load时候传入参数`trust_remote_code=True`

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

@sunzhufeng12345 @badarrrr 请看我们在[README](https://github.com/THUDM/LongWriter/blob/main/train/README.md)中的FAQ是否能解决你们遇到的问题。不好意思让你们久等了。

Yushi Bai

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

[rank4]: Input should be a valid integer, got a number with a fractional part [type=int_from_float, input_value=15099494.4, input_type=float]

可以测试基于OpenAI接口的模型管理框架吗，比如ollama, xinference

results.py bug

Any Implementation of new models like Meta-Llama-3.1-8B , Qwen2.5-7B?

Evaluation mechanism update

上下文长度。我没有在说明中找到关于long writer的上下文长度，是否是沿用的glm4-128k的输入上下文长度。

有人 train 成功了吗？