evaluate issues

`list_evaluation_modules` returns empty list

3

For some reason when i run this: ``` print(evaluate.list_evaluation_modules(include_community=False)) ``` I get an empty list. ``` evaluate==0.4.2 ```

MohamedAliRashad

Benchmark evaluation for language models.

1

Not sure if this feature belongs to this library or would it require a complete separate library. I am proposing the creation of a library where llm benchmarks can be...

mina58

Fix: message for lack of sklearn

When relying on `sklearn`, the help message that should be printed is `pip install scikit-learn`, not `pip install sklearn`.

ArkiZh

add vqa_accuracy and CIDEr

1

Kamichanw

AttributeError: module 'evaluate' has no attribute 'load'

8

![image](https://github.com/user-attachments/assets/c70eb800-1741-4407-b646-63703f4c50b0) i am running evaluate==0.4.2 on python 3.11 and i get this error, please help.

Adesoji1

Unable to compute f1 score - Throwing Value Error trying to convert a string in non english Language to integer

1

I am trying to compte f1 score ,My predictions are mostly numbers whreas my references are sometimes malayalm sting version of these numbers. Thus I am getting a value error...

alans3321

fix import error message for sklearn

for sklearn, if needing install it, the ImportError should write "pip install scikit-learn" instead of "pip install sklearn" now: File "/home/ubuntu/wubing/evaluate/src/evaluate/loading.py", line 265, in _download_additional_modules raise ImportError( ImportError: To be...

bingwork

[Metrics] ValueError: Expected to find locked file from process x but it doesn't exist.

1

When calling .compute in distributed multi-node setting, I get this error - ``` [rank1]: File "/ext3/miniconda3/envs/venv/lib/python3.8/site-packages/transformers/trainer.py", line 2750, in _evaluate [rank1]: metrics = self.evaluate(ignore_keys=ignore_keys_for_eval) [rank1]: File "/ext3/miniconda3/envs/venv/lib/python3.8/site-packages/transformers/trainer_seq2seq.py", line 180, in...

raghavm1

Add `zero_division` to F1 metric

The underlying sklearn function exposes the `zero_division` attribute [1], which is not a keyword argument in the metric wrapper in this library. This means, users get a warning when there...

TimRepke

Update MAUVE's readme and citations

Hi folks, Thank you for maintaining this excellent package! I've just updated mauve's readme based on the following: - the pip package outputs two new fields: `mauve_star` and `frontier_integral_star` -...

krishnap25

evaluate
evaluate copied to clipboard

Metadata

`list_evaluation_modules` returns empty list

Benchmark evaluation for language models.

Fix: message for lack of sklearn

add vqa_accuracy and CIDEr

AttributeError: module 'evaluate' has no attribute 'load'

Unable to compute f1 score - Throwing Value Error trying to convert a string in non english Language to integer

fix import error message for sklearn

[Metrics] ValueError: Expected to find locked file from process x but it doesn't exist.

Add `zero_division` to F1 metric

Update MAUVE's readme and citations

← Metadata

Owner

Metadata

evaluate evaluate copied to clipboard

Metadata

← Metadata

Owner

Metadata

evaluate
evaluate copied to clipboard