liujinyuan
Results
3
issues of
liujinyuan
我没有理解few shot中的KPT-LR后是怎样使用few shot训练数据的?我的理解是仅在LR时使用了few shot提供的数据,而不明白-LR后是怎样运作的。
In your paper the labeling method seems to be labeled by star ratings, so how is this converted to specific floating point scores when training RM?
about score
in section 3.3. Training with Data from a Language Model,how to get keywords from coco caption dataset