Yujian Liu
Results
13
comments of
Yujian Liu
Thanks, this helps a lot!
I'm also interested in the support of multiple datasets. A use case I can think of is during instruction tuning, we would like to also add pre-training loss for regularization.
As a follow up question, is there any methods that can give a relaxed khot vector within [0, 1]? I want to represent the unselected elements as 1 - khot...