Kaiyinzhou

Results 1 issues of Kaiyinzhou

When I train with KTO, the KL value quickly drops to 0, is this normal? ``` {'loss': 0.4173, 'grad_norm': 1.4672807732482507, 'learning_rate': 4.765488274413721e-06, 'rewards/chosen': 1.19 4046974182129, 'logps/chosen': -18.560531616210938, 'rewards/rejected': 0.43546485900878906, 'logps/rejected':...