text-anonymization-benchmark icon indicating copy to clipboard operation
text-anonymization-benchmark copied to clipboard

Create a HuggingFace dataset for TAB

Open omri374 opened this issue 2 years ago • 2 comments

Hi,

The TAB dataset and evaluation approach is amazing! It would be very useful for those interested to train models on this dataset, to have it as a HuggingFace dataset.

Would this be something you'd consider?

omri374 avatar Apr 08 '22 16:04 omri374

Definitely, good suggestion! I'll add to to our todo list :-)

plison avatar Apr 12 '22 12:04 plison

I actually did this a few weeks as I'm gonna use the dataset for some PhD work https://huggingface.co/datasets/mattmdjaga/text-anonymization-benchmark-train https://huggingface.co/datasets/mattmdjaga/text-anonymization-benchmark-val-test

mattmdjaga avatar May 06 '24 11:05 mattmdjaga