ChatProtect
ChatProtect copied to clipboard
A few Q about the dataset architecture of this repository
Hi, congrats for your excellent work! It helps me a lot! However, i have a few Q about the dataset architecture of this repository:
- is ChatProtect/test/test the MainTestSet in the paper?
- for example in path ChatProtect/test/test/sentences/chatgpt /Please_tell_me_about_199091_Austrian_Hockey_League_season/ what is the meaning of m0,m1,m2 and m3? 3.in the same path, the file 0_9_2.txt eg. what is the meaning of these three numbers? 4.at last, could you give a specific explanation of the keys in the txt files, like "tag","wrong" and "orig_tag"?
Hi, thanks for your comment.
- Yes test/test is the MainTestSet in the paper
- m0 - m3 refer to the different baselines for creating an alternative sentence for a sentence pair described in Appendix B in the paper "Ablation Study on Trigger". m0 - i) Continue, m1 - ii) Rephrase, m2 - iii) Q&A and m3 is our method
- The numbering for sentence files is
<num_description>_<num_sentence>_<num_generated_alternative>(all indices starting at 0). For example0_9_2is the- third (2) alternative sentence for the
- tenth (9) sentence of the
- first (0) description
- The meaning is as follows:
tag: The ground-truth label for whether the two sentences are considered self-contradictory according to the annotation procedure described in the paper. If no ground truth was annotated (i.e. intest_big) this is the annotation generated by theaLM.orig_tag: (Optional) Stores the annotation for the sentence pair as generated by theaLMspecified during generation. It can be ignored as it is usually not used / overwritten by the annotation inoutput/test.wrong: (Optional) Which of the two sentences was considered factually incorrect by annotators, eithernone,1,2orboth. Note that if the first sentence (which may occur in multiple files) is considered incorrect in any file, it overrides the annotation in other files. For example if0_9_0containswrong = 1and0_9_1and0_9_2both containwrong = 2then the first sentence is considered factually incorrect in0_9_1and0_9_2as well, being thus equivalent to writingwrong = bothin the two files.
I hope that this information helps at understanding the architecture of the repository and please feel free to reach out if any questions remain.