LLaMA-Factory icon indicating copy to clipboard operation
LLaMA-Factory copied to clipboard

Integrates DeepSpeed-Ulysses sequence parallel, enabling efficient training of large language models with ultra-long sequences.

Open githisw opened this issue 9 months ago • 0 comments

Reminder

  • [x] I have read the above rules and searched the existing issues.

System Info

This project is an extension of LLaMA-Factory that integrates DeepSpeed-Ulysses sequence parallel technology, enabling efficient training of large language models with ultra-long sequences. https://github.com/githisw/LLaMA-Factory I have test it, an will release report later.

Reproduction

Put your message here.

Others

No response

githisw avatar Mar 17 '25 06:03 githisw