multicite icon indicating copy to clipboard operation
multicite copied to clipboard

Do the paper_id in MULTICITE can link to according paper in S2ORC?

Open HongJinTsai opened this issue 4 years ago • 7 comments

Hi! Thanks for proposing such an interesting work ;) I wonder that whether we can use the paper_id in this dataset to find the according paper in the S2ORC? Because I think using the full text or other information of the cited paper may be helpful for my work, it would be great if I can use both of dataset at the same time. Thanks :)

HongJinTsai avatar Jul 13 '21 06:07 HongJinTsai

It seems that the paper ID and sentence IDs are derived from the _pdf_hash attribute in the pdf_parse of S2ORC dataset. However, the detailed rule cannot be inferred easily.

@kyleclo Can you also provide the mapping between the intent_id and the actual intent? Thank you.

jacklxc avatar Jul 21 '21 01:07 jacklxc

Yes, sorry for the delay. I'm uploading a revised version of this shortly w/ the proper IDs / mappings. Thanks for catching this

kyleclo avatar Jul 21 '21 20:07 kyleclo

@kyleclo This is a reminder for uploading the dataset with proper IDs. Thanks!

jacklxc avatar Jul 30 '21 16:07 jacklxc

@kyleclo This is a reminder for uploading the dataset with proper IDs. Thanks!

afei8178 avatar Sep 18 '21 13:09 afei8178

@kyleclo I've been waiting. Thanks!

afei8178 avatar Sep 18 '21 13:09 afei8178

@kyleclo How to use paper_id in full-v20210918.json? Is there a mapping table now?

pcchen-ntunlp avatar Aug 16 '22 05:08 pcchen-ntunlp

@kyleclo Can you please provide the mapping of paper ids in the dataset to the papers IDs in S2ORC dataset.

ManasiPat avatar Jul 21 '23 06:07 ManasiPat