EditNTS icon indicating copy to clipboard operation
EditNTS copied to clipboard

How do I get the files 'val.df.filtered.pos' and 'train.df.filtered.pos

Open oskrmiguel opened this issue 4 years ago • 2 comments

Hello It is very interesting your work, I have a question, how do you get these files that you name in main.py with these extensions 'val.df.filtered.pos' and 'train.df.filtered.pos'; or where can I get them?

oskrmiguel avatar Mar 02 '20 22:03 oskrmiguel

Hi,

Most of the code is in data_preprocess.py, the filtered files remove the rows where the source and the target are identical.

YueDongCS avatar Mar 08 '20 20:03 YueDongCS

Hello, I tried to report your work but I didn't succeed, I have a question on this subject: "In the paper, we filtered out the rows where the source sentence and the target sentence are identical to encourage editing, you can do this by adding a line at line 41 in data_processing.py:

comp_txt,simp_txt=unzip([(i[0],i[1]) for i in zip(comp_txt,simp_txt)] if i[0] != i[1]])."

Apparently the line of code you mention has an error in it that is this imagen

Another question, where do I find the unzip function?

oskrmiguel avatar Apr 16 '20 19:04 oskrmiguel