grover icon indicating copy to clipboard operation
grover copied to clipboard

Input format

Open sjoerd1999 opened this issue 4 years ago • 0 comments

When generating an article using contexual_generate.py, you need to specify a list of variables in a .jsonl file(from what I deducted from the 'april2019_set_mini.jsonl' file): "title", "text", "summary", "authors", "publish-date", "url", "domain", "warc_date", "split" and "inst_index".

Which of these variables actually influence the generation of articles and in what way? I know "title" is the heading, and will be the main topic of the generated article, but do the other variables also contribute to the generation? Also I can't find what "warc_date" and "inst_index" entail.

Thanks in advance!

sjoerd1999 avatar Nov 20 '20 15:11 sjoerd1999