unitxt issues

Suggestion: Add notion of SubTask

1

Today we have tasks such as `tasks.classification.mutli_class` that are very general by using things like: `class_type` then we can have sentiment and emotion classification under the same task by using...

elronbandel

Add FunctionRecipe that utilze the standard recipe to create a function that can be used to parse data at inference.

elronbandel

Add to all type checking error messages the wrong type it got.

elronbandel

Add DynamicFormat that uses different SystemFormat based on size of demos list and existence of instruction.

```python class DynamicFormat(Format): few_shot_format: Format few_shot_with_instruction_format: Format zero_shot_format: Format zero_shot_with_instruction_format: Format def process(instance): has_instruction = "instruction" not in instance or len(instance["instruction"] == 0 has_demos = len(instance["demos"]) == 0 if has_demos:...

elronbandel

new format and system prompt for MtBench

OfirArviv

Handling sensitive data sent to remote services

1

With the introductions of metrics that can send data to remote services - one needs a safe way to avoid accidentally sending propriety/confidential data to external services. In the common...

yoavkatz

Gather all string operators in one standard file

This PR addresses the scattered nature of string/text operators across different modules—operators and processors—making them challenging to locate, reuse, and maintain a consistent standard while tracking changes. The goal is...

elronbandel

possible full import needed in unitxt.dataset

1

@matanor unitxt/src/unitxt/dataset.py currently has from .from .dataset_utils import get_dataset_artifact I just newly cloned both fm-eval and unitxt, and rebuilt the envs. For me, when I try running the basic run_text2text.py...

sam-data-guy-iam

Add plugin operators

3

This is a proposal for a new behaviour that allows to change card operators from the final command such that: `load_dataset("card=cards.wikitq,table_serializer=serializers.table.markdown")` will be loading the wikitq with different table serializer,...

elronbandel

Seed control in unitxt

1

Today, unitxt uses a default seed (42) for all dataset. It's not actually possible to change the seed today. Changing the seed could effect the dataset significantly given random choices,...

yoavkatz

unitxt
unitxt copied to clipboard

Metadata

Suggestion: Add notion of SubTask

Add FunctionRecipe that utilze the standard recipe to create a function that can be used to parse data at inference.

Add to all type checking error messages the wrong type it got.

Add DynamicFormat that uses different SystemFormat based on size of demos list and existence of instruction.

new format and system prompt for MtBench

Handling sensitive data sent to remote services

Gather all string operators in one standard file

possible full import needed in unitxt.dataset

Add plugin operators

Seed control in unitxt

← Metadata

Owner

Metadata

unitxt unitxt copied to clipboard

Metadata

← Metadata

Owner

Metadata

unitxt
unitxt copied to clipboard