chestnut111
chestnut111
becaues we have the loss is the sum of the batch size, Is it necessary to mean it ??
Error: CUDA error when release memory!
CHATGLM2 LORA微调 model = AutoModel.from_pretrained(model_path, device_map='cpu', trust_remote_code=True) model = PeftModel.from_pretrained(model, ckpt_path) # 这里使用你自己的peft adapter model = model.eval() tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True) from fastllm_pytools import llm model = llm.from_hf(model, tokenizer, dtype...
编译问题
cmake .. -DUSE_CUDA=ON -- The CUDA compiler identification is unknown CMake Error at ..a/share/cmake-3.26/Modules/CMakeDetermineCUDACompiler.cmake:603 (message): Compiler output: Call Stack (most recent call first): CMakeLists.txt:39 (enable_language) -- Configuring incomplete, errors occurred!
### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...