liujinyuan

Results 3 issues of liujinyuan

我没有理解few shot中的KPT-LR后是怎样使用few shot训练数据的?我的理解是仅在LR时使用了few shot提供的数据,而不明白-LR后是怎样运作的。

In your paper the labeling method seems to be labeled by star ratings, so how is this converted to specific floating point scores when training RM?

about score

in section 3.3. Training with Data from a Language Model,how to get keywords from coco caption dataset