evals issues

Results 428 evals issues

Sort by recently updated

Eval: Nostr Bot Detection

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

AtlantisPleb

Email probability eval

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

mmtmn

Other Support

No offense intended, suggested or implied to Python developers. However, a well known issue with Python, is that once developers learn Python, they tend to not want to learn anything...

chatbots

Inferring dates from relative date descriptions [0.61 accuracy on gpt-3.5-turbo]

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

mgibson707

Swap neighboring characters in a sentence

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

BenEaston

:sparkles: Added Eval For Sam Altman Degree (he has an honorary degree from University of Waterloo)

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

OrenLeung

Dataset hosting, data cards and previews

Hi, I'm Quentin from Hugging Face :) I know hosting datasets on github is not always practical: git lfs required, no data preview, limited storage (maybe not for you haha),...

lhoestq

Temperature Conversion from Celsius to Fahrenheit (Fails on very high numbers)

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

anishjain123

Add translated movie names eval

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

PatrickPijnappel

CRM Financial Services Expert (26% accuracy GPT-3.5-Turbo)

# Thank you for contributing an eval! ♥️ 🚨 Please make sure your PR follows these guidelines, __failure to follow the guidelines below will result in the PR being closed...

zestor

evals
evals copied to clipboard

Metadata

Eval: Nostr Bot Detection

Email probability eval

Other Support

Inferring dates from relative date descriptions [0.61 accuracy on gpt-3.5-turbo]

Swap neighboring characters in a sentence

:sparkles: Added Eval For Sam Altman Degree (he has an honorary degree from University of Waterloo)

Dataset hosting, data cards and previews

Temperature Conversion from Celsius to Fahrenheit (Fails on very high numbers)

Add translated movie names eval

CRM Financial Services Expert (26% accuracy GPT-3.5-Turbo)

← Metadata

Owner

Metadata

evals evals copied to clipboard

Metadata

← Metadata

Owner

Metadata

evals
evals copied to clipboard