verl icon indicating copy to clipboard operation
verl copied to clipboard

WIP: FP8 train

Open horsebridge opened this issue 6 months ago • 3 comments

A draft for FP8 training. It currently depends on fsdp2 and torchao to train with per-tensor FP8 quantization. To enable it, install torchao and set strategy=fsdp2 and fsdp_config.fp8=True. Note: it cannot ensure convergence now

horsebridge avatar May 12 '25 14:05 horsebridge

CLA assistant check
All committers have signed the CLA.

CLAassistant avatar May 12 '25 14:05 CLAassistant

A draft for FP8 training. It currently depends on fsdp2 and torchao to train with per-tensor FP8 quantization. To enable it, install torchao and set strategy=fsdp2 and fsdp_config.fp8=True. Note: it cannot ensure convergence now

@horsebridge Hello, have you tested the throughput change in FP8 training? In my experiments, setting fsdp_config.fp8=True results in much slower speed.

Yzx835 avatar May 26 '25 09:05 Yzx835

@Yzx835 Yes, I also observed the slower speed. I think it is an issue in torchao.

horsebridge avatar May 28 '25 11:05 horsebridge