alignment-handbook Why does the alignment-handbook account for user & system Inputs in loss calculation

Why does the alignment-handbook account for user & system Inputs in loss calculation

Open xffxff opened this issue 1 year ago • 3 comments

I noticed that the alignment-handbook doesn't ignore the loss calculated from both the user and system inputs Based on my knowledge, many SFT choose to ignore these. I'm curious about the reasoning behind this difference.

Nov 28 '23 06:11 xffxff

alignment-handbook alignment-handbook copied to clipboard

Why does the alignment-handbook account for user & system Inputs in loss calculation

alignment-handbook
alignment-handbook copied to clipboard