text-anonymization-benchmark
text-anonymization-benchmark copied to clipboard
Create a HuggingFace dataset for TAB
Hi,
The TAB dataset and evaluation approach is amazing! It would be very useful for those interested to train models on this dataset, to have it as a HuggingFace dataset.
Would this be something you'd consider?
Definitely, good suggestion! I'll add to to our todo list :-)
I actually did this a few weeks as I'm gonna use the dataset for some PhD work https://huggingface.co/datasets/mattmdjaga/text-anonymization-benchmark-train https://huggingface.co/datasets/mattmdjaga/text-anonymization-benchmark-val-test