janmaltel
janmaltel
Oh, that is great! Thank you very much for your answer. I still would advocate for a general toggle/option for anonymization. At our university, the student submissions we receive are...
I'm not yet understanding how to use https://github.com/huggingface/datasets/pull/5580 in order to use `load_dataset(data_files="s3://...")`. Any help/example would be much appreciated :) thanks!
>Might adapt to multi-GPU in a bit to speed up training. Could you point me to the reason why this is not working on multiple GPUs? I.e., which part is...