FinQA icon indicating copy to clipboard operation
FinQA copied to clipboard

Description for the attributes in the data file

Open icedpanda opened this issue 2 years ago • 0 comments

Hi @czyssrs, thanks for your excellent work

I only found these descriptions available in the README.file.

"pre_text": the texts before the table;
"post_text": the text after the table;
"table": the table;
"id": unique example id. composed by the original report name plus example index for this report. 

"qa": {
  "question": the question;
  "program": the reasoning program;
  "gold_inds": the gold supporting facts;
  "exe_ans": the gold execution result;
  "program_re": the reasoning program in nested format;
}

However, after walking through the actual train.json file, I found many other attributes like table_ori, table_retrieved, text_retrieved, etc. Just wondering if its possible that you could add a description for each attribute that in your dataset?

Also:

  1. Isn't text_retrieved and table_retrieved with corresponding scores generated by a model?
  2. what is model_input, tfidftopn under qa?
  3. answer and exe_ans are different for Sample 1 (filename":"ADI/2009/page_49.pdf") in the train.json.
    • answer:380, exe_ans:3.8

Thanks for your help and support

icedpanda avatar May 17 '22 05:05 icedpanda