shallow-vs-deep-alignment icon indicating copy to clipboard operation
shallow-vs-deep-alignment copied to clipboard

KL Divergence Computation

Open guoyang9 opened this issue 3 months ago • 0 comments

Hi Authors,

Thank you for your great piece of work! Can I check with you how you computed the KL divergence between aligned and unaligned models? For example, the aligned Llama2-7B-Chat model requires a chat template while the unaligned model does not. Did you consider the special tokens that are required by the aligned chat model?

guoyang9 avatar Sep 22 '25 07:09 guoyang9