evals issues

Offtopic: Downloading chatgpt history

6

Hi, I'm sorry to ask this here, but don't know where else to go. I have hundreds of prompts in my chatgpt history, a few of which, I think will...

irthomasthomas

Add slope-intercept eval

4

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

MrDevel0per

Mistakes

satisfied with the use but there is a problem that occurs constantly. When solving problems in python, it outputs the correct code, but the result of this code does not...

Maximkrupchatnikov

add predict look-again-and-say sequence eval test

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

gimseng

Large Multiplication High Precision Eval

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

mmtmn

New eval for math contests (AMC 10/12)

2

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

emptycrown

Chess best move

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

mybarman

Chess: Counting pieces left on the board

1

## Eval details 📑 ### Eval name Chess Piece Count ### Eval description Tests the models ability to understand and play out chess moves by reading input in a PGN...

jatinparab98

Chess draw by insufficient material

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

TonyLLondon

Chess legal setup

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

TonyLLondon

evals
evals copied to clipboard

Metadata

Offtopic: Downloading chatgpt history

Add slope-intercept eval

Mistakes

add predict look-again-and-say sequence eval test

Large Multiplication High Precision Eval

New eval for math contests (AMC 10/12)

Chess best move

Chess: Counting pieces left on the board

Chess draw by insufficient material

Chess legal setup

← Metadata

Owner

Metadata

evals evals copied to clipboard

Metadata

← Metadata

Owner

Metadata

evals
evals copied to clipboard