chestnut111 issues

Results 5 issues of


                                            chestnut111

mean of the loss

becaues we have the loss is the sum of the batch size, Is it necessary to mean it ??

Error: CUDA error when release memory!

Error: CUDA error when release memory!

PEFT LOAR Segmentation fault (core dumped)

CHATGLM2 LORA微调 model = AutoModel.from_pretrained(model_path, device_map='cpu', trust_remote_code=True) model = PeftModel.from_pretrained(model, ckpt_path) # 这里使用你自己的peft adapter model = model.eval() tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True) from fastllm_pytools import llm model = llm.from_hf(model, tokenizer, dtype...

编译问题

cmake .. -DUSE_CUDA=ON -- The CUDA compiler identification is unknown CMake Error at ..a/share/cmake-3.26/Modules/CMakeDetermineCUDACompiler.cmake:603 (message): Compiler output: Call Stack (most recent call first): CMakeLists.txt:39 (enable_language) -- Configuring incomplete, errors occurred!

[Bug] 2卡internvl2-26b推理，卡间通信是pcie会失败，nvlink会成功，这是为啥

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...