unitxt issues

Performance Suggestion: Replace dict1.keys() & dict2.keys() with set(dict1).intersection(dict2) for Improved Speed and Memory Efficiency

https://github.com/IBM/unitxt/blob/231fd293f53baac6be94133c606d7ddaf66eacd4/src/unitxt/artifact.py#L50 I’d like to propose a minor optimization related to key intersection operations between two dictionaries. Original Code： `keys_in_both = dict1.keys() & dict2.keys()` Suggested Replacement： `keys_in_both = set(dict1).intersection(dict2)` Although both...

SaFE-APIOpt

enhancement

Negative values when evaluating context relevance metrics

4

We encountered an issue where we are getting negative values when evaluating `metrics.rag.context_relevance.sentence_bert_mini_lm`. The expectation here is to get a value between a positive value between 0 and 1. This...

algadhib

Understanding how the metrics are calculated

1

Hi, I am looking athe Multiple-choice QA example provided [here](https://github.com/IBM/unitxt/blob/main/examples/multiple_choice_qa_evaluation.py) . I wanted to know how the accuracy is calculated. Do we 1) ask the model to generate the answer,...

murthyrudra

question

The current instance_score.summary is not readable

1

print("Instance Results:") print(results.instance_scores.summary) ``` text_a text_a_type \ 0 The drain is clogged with hair. It has to be cleaned. premise 1 Jane knocked on Susan's door but she did not...

yoavkatz

stale

Remove deprecated models from metrics/inference engines

1

elronbandel

stale

Supported multiple aliases for environment variables.

1

In my view, there is a need for a common utility in unitxt, that allows specifying multiple environment variable names and ensuring either only one is set, or all the...

yoavkatz

stale

'This event loop is already running' error when using LiteLLM inside another event loop

1

LiteLLMInferenceEngine, as it is implemented on unitxt, creates an event loop. FastAPI/Jupyter Notebooks also create one. `asyncio` doesn't allow nested loop, so we need to rely on `nest_asyncio` to handle...

martinscooper