Xianchao Wu
Results
1
issues of
Xianchao Wu
https://github.com/OpenBMB/InfiniteBench/blob/main/src/compute_scores.py#L238 1. only one reference label is used for comparison, better loop around each answer in label, e.g., label=['ECKER', 'COMMANDER BILL ECKER']; 2. prediction phrase is splitted into words for...