PENG,Yiyang

Results 1 issues of PENG,Yiyang

Hi ! I found you have got the hallucination score of QWEN3 MODEL with parameter --enable_thinking=False. I wonder if there is a big hallucination score gap between thinking mode and...