Datasets icon indicating copy to clipboard operation
Datasets copied to clipboard

Poetry-related datasets developed by THUAIPoet (Jiuge) group.

THUAIPoet Datasets

This repository provides datasets developed by THUAIPoet (九歌) group, Research Center for Natural Language Processing, Computational Humanities and Social Sciences, Tsinghua University. Note that all our datasets are released for academic use only.

We will keep improving existing datasets and release more sets in the future. Any suggestions are welcome!

Dataset Version
THU Poetry Quality Evaluation DataSet (THU-PQED) V0.1
THU Fine-grained Sentimental Poetry Corpus (THU-FSPC) V1.0
THU Chinese Classical Poetry Corpus (THU-CCPC) V1.0
THU Chinese Rhythm and Rhyme Data (THU-CRRD) V0.1