Yoav Katz comments

Results 74 comments of


                                            Yoav Katz

matthews_correlation returning 0 on perfect correlation

Yes. You are right - as this is correlation [1,0] and [0,1] are indeed anti-correlated (-1). You can see what they did in f1 (and what they plan to do...

Remaining issues with additional datasets

I think MultipleChoiceTemplate is an additional possible template. I think it's only relevant for multi class and not mult label.

Increase n_resamples for GlobalMetric in testing so confidence intervals are not NaN

@matanor - Please advise. I saw these warnings too.

Add relation extraction

Hi. I added my comments. I think you should create a card that uses the tasks, and loads the raw data from the file, and converts it to the format...

Add relation extraction

> Since this is an important NLP task i suggest we try to get it merged asap: > > My suggestion is to follow the conventions and naming in the...

You need to add evaluate_ensemble_judge.py excluded_files = [ "use_llm_as_judge_metric.py", "standalone_evaluation_llm_as_judge.py", "evaluate_summarization_dataset_llm_as_judge.py", "evaluate_different_formats.py", "evaluate_different_templates.py", "evaluate_different_demo_selections.py", "evaluate_a_judge_model_capabilities_on_arena_hard.py", "evaluate_a_model_using_arena_hard.py", "evaluate_llm_as_judge.py", "evaluate_using_metrics_ensemble.py", "evaluate_existing_dataset_by_llm_as_judge.py", ] in unitxt/tests/library/test_examples.py. Without it, the regression tries to run your...

Yoav Katz

matthews_correlation returning 0 on perfect correlation

Remaining issues with additional datasets

Increase n_resamples for GlobalMetric in testing so confidence intervals are not NaN

Add relation extraction

Add relation extraction

Llm as judge ensemble

Update CONTRIBUTING.md - update path name for activate

Add logprobs functionality

Slow performance due copying of instances

Is metadata well exported