PaddleNLP
PaddleNLP copied to clipboard
[llm]support long sequence training
PR types
New features
PR changes
Others
Description
新增支持单机8卡 llama 128k训练 待解决问题:
- 如果fuse_fused_head_and_loss_fn,开启pp和开eval的时候loss异常需要排查