alignment-handbook
alignment-handbook copied to clipboard
Why does the alignment-handbook account for user & system Inputs in loss calculation
I noticed that the alignment-handbook doesn't ignore the loss calculated from both the user and system inputs Based on my knowledge, many SFT choose to ignore these. I'm curious about the reasoning behind this difference.