Usama
Usama
Closing the PR due to inactivity; please reopen if you get a chance to address comments.
Closing the PR due to inactivity; please reopen if you get a chance to address comments.
Closing the PR due to inactivity; please feel free to reopen if you get a chance to address the comments.
Closing the PR due to inactivity; please reopen if you get a chance to address comments.
Sorry for the confusion. I mean the description in the `.yaml` file because once this PR is merged, the only way to get any information about this eval will be...
You need to merge the `master` branch into your branch to resolve workflow-related issues. Kindly update your branch with the latest master branch.
Thanks for opening this PR, Character-level reasoning and operations are a well-known failure mode of the model due to a common underlying issue in LLMs. In its current form, this...
Thank you for opening this PR. We're not accepting evals that have custom code implementations at this moment (but we are accepting custom model-graded evals). If possible, could you rewrite...
Closing the PR due to inactivity; please reopen if you get a chance to address comments.
Thanks for opening this PR. To provide output for such a complex piece of code, it is hard for the model to do a zero-shot without a chance to reason...