FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

How to implement weight decay towards the pre-trained model?

Open sedol1339 opened this issue 1 year ago • 0 comments

Hello, let me one question.

If using FastChat for supervised fune-tuning, how do I implement penalizing the distance between starting and current weights? This was shown to be effective in https://arxiv.org/abs/1706.03610

sedol1339 avatar Oct 03 '24 11:10 sedol1339