MedicalGPT icon indicating copy to clipboard operation
MedicalGPT copied to clipboard

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Results 45 MedicalGPT issues
Sort by recently updated
recently updated
newest added

- [ ] I checked to make sure that this is not a duplicate issue ### Describe the solution you'd like A clear and concise description of what you want...

enhancement

### Describe the Question Please provide a clear and concise description of what the question is. 感谢分享工作! SFT训练还没结束时,我选了其中一个保存的checkpoint,使用merge_peft_adapter.py融合llama2-7B和这个lora model,但是最终融合的文件后缀是这样的model-00001-of-00002.safetensors,很奇怪,我想在这个融合模型基础上继续做SFT就不行了,不知道为何。。。

question

### Describe the Question Please provide a clear and concise description of what the question is. chatglm2是不是做不了PPO相关的训练,我在rm模型中用了bert训练,但是无法合并参数,同时第四部的rl训练也显示ChatGLM2模型没有AutoModelForCausalLMWithValueHead,请问这种情况是不是只能换模型了

question

### Describe the Question Please provide a clear and concise description of what the question is. 大佬可以提供一个关于从预训练到SFT再到RLHF的各个阶段训练即推理的例子吗,把这几个串一下,比如预训练后,推理测试,感觉ok后,再进入SFT阶段,完后再推理测试,以此类推,这样有利于大家一起来讨论这个过程中需要改进的点?多谢了

question

各位大佬,对chatglm3进行预训练运行pretraining.py时报错: RuntimeError: CUDA error: device-side assert triggered Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 想问一下怎么解决啊 每次都是训练到134步时报错,之前训练都是好的,突然就这样了

bug