cbert_aug icon indicating copy to clipboard operation
cbert_aug copied to clipboard

More details about how SST-2 is prepared

Open kaniblu opened this issue 5 years ago • 0 comments

The SST-2 dataset included in the repo contains 6,228 training samples, 692 validation samples, and 1821 test samples. But the official SST-2 dataset (which can be access via torchtext) contains 6,920 training binary-class samples, and 872 validation binary-class samples. What gives? Could you clarify the discrepancy?

kaniblu avatar Jul 30 '20 13:07 kaniblu