unitxt
unitxt copied to clipboard
🦄 Unitxt: a python library for getting data fired up and set for training and evaluation
Currently demo examples are selected from the demo pool (which is typically extracted from the train set). The selection of demos per instance is random by default, or more complex...
Today the code in formats.py assumes the target prefix for the demos is taken from the instance. This may cause subtle issue when the target_prefix uses input field for formatting...
The code below will call a full-dataset metric computation for each row - i.e. the work is duplicated num_row times. This is a big issue for huggingface metrics, such as...
Hi, I noticed that in some templates of MMLU, it seems that the part of the prompt "The following are multiple choice questions (with answers) about {topic}." is being placed...
I recently encountered an issue where a change in the code, specifically modifying the check from `if key == "type"` to `if key == "__type__"`, caused my locally saved data...
1) Seperate the structured representation and make it available at a dedicated field for people want to use it externally (e.g for using open ai api) (2) change existing formats...