ART
ART copied to clipboard
Torchtune development has been discontinued. Have you considered switching to a different multi-GPU train backend?
ref: https://github.com/meta-pytorch/torchtune/issues/2883
Candidates
- torchforge
- verl
- TRL
- prime-rl
the Candidate Post-training frameworks:
- veOmni
- automodel
- llama-factory
- ms-swift
RL :
- veRL
Yes, we are currently experimenting with Megatron to support bigger models. We might also consider other options.