Kaiyang Guo

Results 2 issues of Kaiyang Guo

Thanks for sharing the amazing repo! The GPT-4 win rate prompt stated in the paper is attached below. As HH dataset concerns both helpful and harmless, I wonder why only...

Thanks for sharing this awesome repo! The paper reports results on MMLU, GSM8K, HumanEval and BigBench-Hard. It seems this repo does not contain the codes for evaluating on these benchmark...