transformerlab-app icon indicating copy to clipboard operation
transformerlab-app copied to clipboard

More flexible custom dataset file structures

Open dadmobile opened this issue 1 year ago • 0 comments

Our underlying code uses huggingface load_dataset which allows for flexible file system structures for custom local datasets:

https://huggingface.co/docs/hub/en/datasets-file-names-and-splits

But our app and API code force the user to use a very specific format (exactly one file each of <dataset_id>[train|eval].jsonl)

dadmobile avatar Feb 28 '24 19:02 dadmobile