evals issues

Problem of oaieval.py

2

The oaieval.py still preformed the very first version of the dataset after I updated the jsonl file as the program only execute one item while I have three in my...

JimmyThere

make it smarter

5

datboimiles23

Sentiment Analysis

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

Aaronmanuel

Aristotle poems

3

Update PULL_REQUEST_TEMPLATE.md and add eval categories # Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will...

Aaronmanuel

Improve caching mechanism for prompt generation and evaluation

3

Changes: - Refactor the existing caching mechanism in evals/utils.py to utilize a more efficient and flexible data structure, such as an LRU cache, to store prompt and evaluation results. -...

ZenomHunter123

Projection Distances (0% Accuracy)

2

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

ccpfoye

2truths1lie (27% accuracy)

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

hackgoofer

Solve Anagram (16% accuracy)

3

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

l-mutricy

SG PSLE EVAL

1

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

rohan3107

Add date epoch eval

3

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

vedran

evals
evals copied to clipboard

Metadata

Problem of oaieval.py

make it smarter

Sentiment Analysis

Aristotle poems

Improve caching mechanism for prompt generation and evaluation

Projection Distances (0% Accuracy)

2truths1lie (27% accuracy)

Solve Anagram (16% accuracy)

SG PSLE EVAL

Add date epoch eval

← Metadata

Owner

Metadata

evals evals copied to clipboard

Metadata

← Metadata

Owner

Metadata

evals
evals copied to clipboard