ASOTE icon indicating copy to clipboard operation
ASOTE copied to clipboard

How can be re-used in a totally new and different dataset?

Open Jurys22 opened this issue 2 years ago • 1 comments

Do the users need to annotate the dataset for training,testing,evaluation? or do the users can re-use already annotated datasets available?

I would like to test this repo against tweets - would it be work without data annotation?

Thank you for the open source code!

Jurys22 avatar Mar 25 '22 12:03 Jurys22

Do the users need to annotate the dataset for training,testing,evaluation? or do the users can re-use already annotated datasets available?

I would like to test this repo against tweets - would it be work without data annotation?

Thank you for the open source code!

Thanks for your interest in our work.

  1. If you have an annotated tweet dataset, you can directly use the codes in this repo on the dataset. The simplest way to test this repo on the tweet dataset have three steps: (1) Convert the tweet dataset and make it have the same format as the datasets used in this repo, for example, https://github.com/l294265421/ASOTE/tree/main/ASOTE-data/absa/ASOTE-v2/rest14/asote_gold_standard. (2) Replace the rest14 dataset (https://github.com/l294265421/ASOTE/tree/main/ASOTE-data/absa/ASOTE-v2/rest14/asote_gold_standard) with the tweet dataset. (3) Run the commands related to the rest14 dataset in PBF section of README (https://github.com/l294265421/ASOTE). Then, you can obtatin the model peformance on your tweet dataset.
  2. If your tweet dataset has not been annotated, you can train our model on the datsets provided in this repo, then extract opinion triplets from the tweet dataset using the trained model (cross-domain opinion triplet extraction). To achieve this, you can also follow the three steps above, but only replace the test dataset with your tweet data.

l294265421 avatar Mar 25 '22 13:03 l294265421