uda
uda copied to clipboard
releasing other text classification models, datasets & unlabeled corpus
Hi, thanks for releasing the paper & code.
I have tried the IMDb text classification task and UDA achieved quite promising improvements. Will you release your models and datasets for the other text classification tasks, especially the unlabeled corpus? So that your work will be easier to follow and have a larger impact.
Thanks
Hi, you can directly use the current code for other datasets and we used similar hyperparameters for them. You can get the supervised data for other datasets from here. The labeled examples for semi-supervised learning are chosen randomly. Here is the unsupervised data for Yelp and Amazon.
Thanks! That's awesome!
Thanks for your reply. Another little question. How many unlabeled data did you use for Yelp and Amazon reviews? I suppose that you used all unlabeled reviews according to your description in the paper.
Thanks!
Yes, we used all unlabeled reviews. Sorry for the late reply!