PETER icon indicating copy to clipboard operation
PETER copied to clipboard

询问数据集中的问题

Open FancyXie opened this issue 1 year ago • 4 comments

你好,看了你的PETER论文很受启发。我想拿另外领域的数据集来做对比实验,需要把数据改成你的这种形式。请问TripAdvisor数据集文件夹下面的1里面的train.index是什么意思?reviews.pickle里面的记录,比如[{'user': '87397110FAD35B2CA4419D2892904CE3', 'item': '1068719', 'rating': 4, 'template': ('hospitality', 'one', 'the w is trying too hard to have style and be hip and this erodes a bit of the asian hospitality one has come to expect in hong kong', 1), 'predicted': 'apps'}]里面的template的每一个是什么意思呢?predicted是什么意思呢?

FancyXie avatar Oct 31 '23 13:10 FancyXie

数据集链接里的README里有详细介绍,请查看

lileipisces avatar Oct 31 '23 18:10 lileipisces

你好,我看了数据集readme的介绍。但是还有一些疑问希望您能帮忙解答一下。'template': ('hospitality', 'one', 'the w is trying too hard to have style and be hip and this erodes a bit of the asian hospitality one has come to expect in hong kong', 1), 'predicted': 'apps'},这个template里面的(feature, adjective, sentence, sentiment),以及predicted,分别是经过什么处理之后得到的呢? 因为JSON格式的文件里貌似不包括这些信息。

FancyXie avatar Nov 27 '23 02:11 FancyXie

你看一下我们CIKM20那篇paper的数据处理部分吧,我github上也有数据处理的repo (sentires-guide)

lileipisces avatar Nov 27 '23 02:11 lileipisces

好的,非常感谢你!

FancyXie avatar Nov 27 '23 02:11 FancyXie