Peixuan Xia
Results
1
comments of
Peixuan Xia
I met the same problem. When I used LLama2_7b_chat_hf model to evaluate my RAG results, I found some metrics like answer_correctness were always `np.nan`. I guess the code can not...