alignment-handbook
alignment-handbook copied to clipboard
Question about the evaluation dataset
Hi, I wonder which TruthfulQA task you are focusing on during evaluation? MC1, MC2, or generation task?
asking about this as well.