verl WIP: FP8 train

WIP: FP8 train

Open horsebridge opened this issue 6 months ago • 3 comments

A draft for FP8 training. It currently depends on fsdp2 and torchao to train with per-tensor FP8 quantization. To enable it, install torchao and set strategy=fsdp2 and fsdp_config.fp8=True. Note: it cannot ensure convergence now

May 12 '25 14:05 horsebridge

All committers have signed the CLA.

May 12 '25 14:05 CLAassistant

A draft for FP8 training. It currently depends on fsdp2 and torchao to train with per-tensor FP8 quantization. To enable it, install torchao and set strategy=fsdp2 and fsdp_config.fp8=True. Note: it cannot ensure convergence now

@horsebridge Hello, have you tested the throughput change in FP8 training? In my experiments, setting fsdp_config.fp8=True results in much slower speed.

May 26 '25 09:05 Yzx835

@Yzx835 Yes, I also observed the slower speed. I think it is an issue in torchao.

May 28 '25 11:05 horsebridge

verl verl copied to clipboard

WIP: FP8 train

verl
verl copied to clipboard