direct-preference-optimization icon indicating copy to clipboard operation
direct-preference-optimization copied to clipboard

Qwen model issues & embedding and loss has nan

Open lylcst opened this issue 1 year ago • 5 comments

after a loss backward and optimizer step, then forward the embedding layer output hidden states become inf and loss is nan.

lylcst avatar Nov 03 '23 12:11 lylcst