UCTopic
UCTopic copied to clipboard
An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsupervised Contrastive Learning for Phrase Representations and Topic...
In section 4.3 Topical Phrase Mining, for dataset construction, spaCy was used. Could you provide the processed datasets (Gest, KP20k, KPTimes) which have annotated phrases? Thank you.
Let's take an example "Allie drove to Boston for a meeting." When I pretrain UCTopic, the model takes input_ids as [0, 50264, 324, 4024, 7, 2278, 13, 10, 529, 4,...