DAMO-ConvAI icon indicating copy to clipboard operation
DAMO-ConvAI copied to clipboard

Inference script for the level-3 data

Open polgrisha opened this issue 2 years ago • 1 comments

Hello,

I am trying to run evaluation of the Lynx model on level-3 data. However, I did not find the inference script and unsure of how to reproduce it.

My question is: Did the model generate all steps of tool calling during the lvl-3 evaluation and received feedback from tool after each step? How the errors were handled? How the errors were parsed and added into the tool output? How the errors were incorporated in the tool output if the model hallucinated and generated something that couldn't be parsed? Is it possible to provide the full script of running the Lynx model on lvl-1, lvl-2 and lvl-3 data?

Thanks in advance Gregory

polgrisha avatar Nov 28 '23 09:11 polgrisha

Hello, have you successfully run the level 3 evaluation? Could you please share the code?

Watebear avatar Jul 29 '24 07:07 Watebear