unitxt icon indicating copy to clipboard operation
unitxt copied to clipboard

🦄 Unitxt: a python library for getting data fired up and set for training and evaluation

Results 201 unitxt issues
Sort by recently updated
recently updated
newest added

Currently demo examples are selected from the demo pool (which is typically extracted from the train set). The selection of demos per instance is random by default, or more complex...

Today the code in formats.py assumes the target prefix for the demos is taken from the instance. This may cause subtle issue when the target_prefix uses input field for formatting...

The code below will call a full-dataset metric computation for each row - i.e. the work is duplicated num_row times. This is a big issue for huggingface metrics, such as...

Adding Bluebench, including the arena-hard

Moutté

Hi, I noticed that in some templates of MMLU, it seems that the part of the prompt "The following are multiple choice questions (with answers) about {topic}." is being placed...

I recently encountered an issue where a change in the code, specifically modifying the check from `if key == "type"` to `if key == "__type__"`, caused my locally saved data...

1) Seperate the structured representation and make it available at a dedicated field for people want to use it externally (e.g for using open ai api) (2) change existing formats...