unitxt issues

Add type checking for prediction_type of complex tasks like RAG

In complex tasks, like RAG, there prediction_type is a Dictionary with a fixed fields. Today, it is modeled as prediction_type: "Dict" We want to be able to write: “prediction_type”: “Dict[{“answer”:...

yoavkatz

Replace confusing names of inputs and outputs of Task

Today tasks have "inputs" and "outputs". This is confusing many users - who expect output fields to relate to the model output. Actually it should be named: "input_fields" - fields...

yoavkatz

Provide access to raw predictions and references

Unitxt has two concepts: (Raw/Texutal) Predictions and Processed Predictions (after applying post processing operators) Today, the raw predictions which are added to the dataset during evaluation are transformed into the...

yoavkatz

New HF datasets version causes unit test failures when using filtering_lambda

Localized an temporarily disabled test in test_loaders. @unittest.skip("Currently this fails from datasets 2.20") def test_load_from_HF_multiple_innvocation_with_filter(self): loader = LoadHF( path="CohereForAI/aya_evaluation_suite", name="aya_human_annotated", filtering_lambda='lambda instance: instance["language"]=="eng"', ) ms = loader.process() dataset = ms.to_dataset()...

yoavkatz

bug

Clean up splitters in catalog

These splitters in unitxt/prepare/splitters/missing_split.py are do not have informative names, so when people read them in a file , it's not clear what's happening. I think it's just better to...

yoavkatz

ease-of-use

Improve operator documentation

1

https://unitxt.readthedocs.io/en/latest/docs/operators.html Some issues with this page: 1.Not all operators appear in the page (e.g. Get) , probably because they come from other files 2. It's very hard to look for...

yoavkatz

documentation

ease-of-use

Artifact.from_dict recursively interprets fields that it shouldn't

2

`Artifact.from_dict` will try to convert any dictionary to artifact even when the dictionary is a mapping of columns from datasets. **There cannot be a column named `type`:** ```python import tempfile...

jezekra1

Move templates to task

1

Add to the recipe an option to use the templates list from the task and use the first one by default. This simplify the construction of new cards assuming they...

elronbandel

Add search filtering and data mixing based on tags

elronbandel

Add descriptions and tags to tasks

Descriptions for tasks can allow easier synthetic template generation, as well as seamless integration with libraries like instruct lab and DSPy.

elronbandel

unitxt
unitxt copied to clipboard

Metadata

Add type checking for prediction_type of complex tasks like RAG

Replace confusing names of inputs and outputs of Task

Provide access to raw predictions and references

New HF datasets version causes unit test failures when using filtering_lambda

Clean up splitters in catalog

Improve operator documentation

Artifact.from_dict recursively interprets fields that it shouldn't

Move templates to task

Add search filtering and data mixing based on tags

Add descriptions and tags to tasks

← Metadata

Owner

Metadata

unitxt unitxt copied to clipboard

Metadata

← Metadata

Owner

Metadata

unitxt
unitxt copied to clipboard