RL4LMs
RL4LMs copied to clipboard
Mix-Precision training
Hey, Are there any plans to add support for mixed precision training? I did see in #12 a temporary solution was suggested, but it still throws multiple exceptions relating to mathematical operations between fp16 and fp32 values. Thanks! @rajcscw