RL-based-Graph2Seq-for-NQG icon indicating copy to clipboard operation
RL-based-Graph2Seq-for-NQG copied to clipboard

Question: which input fields are actually required for the model?

Open andrei-volkau opened this issue 3 years ago • 1 comments

I am sending an example.

I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.

{
  "text3": "Twins",
  "text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
  "text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
  "annotation3": {
    "raw_text": "Twins",
    "toks": "Twins",
    "POSs": "NNS",
    "positions": "0-0-5",
    "NERs": "O"
  },
  "id": "56de8c374396321400ee2a11",
  "annotation2": {
    "raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
    "toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?",
    "POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .",
    "positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89",
    "NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O"
  },
  "annotation1": {
    "raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .",
    "positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78",
    "NERs": "O O DATE O O O O O O O O O O O O",
    "graph": {
      "g_features": [
        "twins",
        "-lrb-",
        "1988",
        "-rrb-",
        ",",
        "a",
        "comedy",
        "with",
        "danny",
        "devito",
        ",",
        "also",
        "proved",
        "successful",
        "."
      ],
      "g_adj": {
        "0": [
          {
            "node": 1,
            "edge": "punct"
          },
          {
            "node": 2,
            "edge": "appos"
          },
          {
            "node": 3,
            "edge": "punct"
          },
          {
            "node": 4,
            "edge": "punct"
          },
          {
            "node": 6,
            "edge": "appos"
          },
          {
            "node": 10,
            "edge": "punct"
          }
        ],
        "6": [
          {
            "node": 5,
            "edge": "det"
          },
          {
            "node": 7,
            "edge": "prep"
          }
        ],
        "7": [
          {
            "node": 9,
            "edge": "pobj"
          }
        ],
        "9": [
          {
            "node": 8,
            "edge": "amod"
          }
        ],
        "12": [
          {
            "node": 0,
            "edge": "nsubj"
          },
          {
            "node": 11,
            "edge": "advmod"
          },
          {
            "node": 13,
            "edge": "oprd"
          },
          {
            "node": 14,
            "edge": "punct"
          }
        ]
      },
      "num_edges": 14
    }
  }
}

andrei-volkau avatar Oct 20 '20 13:10 andrei-volkau

I am sending an example.

I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.

{
  "text3": "Twins",
  "text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
  "text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
  "annotation3": {
    "raw_text": "Twins",
    "toks": "Twins",
    "POSs": "NNS",
    "positions": "0-0-5",
    "NERs": "O"
  },
  "id": "56de8c374396321400ee2a11",
  "annotation2": {
    "raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
    "toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?",
    "POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .",
    "positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89",
    "NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O"
  },
  "annotation1": {
    "raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
    "POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .",
    "positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78",
    "NERs": "O O DATE O O O O O O O O O O O O",
    "graph": {
      "g_features": [
        "twins",
        "-lrb-",
        "1988",
        "-rrb-",
        ",",
        "a",
        "comedy",
        "with",
        "danny",
        "devito",
        ",",
        "also",
        "proved",
        "successful",
        "."
      ],
      "g_adj": {
        "0": [
          {
            "node": 1,
            "edge": "punct"
          },
          {
            "node": 2,
            "edge": "appos"
          },
          {
            "node": 3,
            "edge": "punct"
          },
          {
            "node": 4,
            "edge": "punct"
          },
          {
            "node": 6,
            "edge": "appos"
          },
          {
            "node": 10,
            "edge": "punct"
          }
        ],
        "6": [
          {
            "node": 5,
            "edge": "det"
          },
          {
            "node": 7,
            "edge": "prep"
          }
        ],
        "7": [
          {
            "node": 9,
            "edge": "pobj"
          }
        ],
        "9": [
          {
            "node": 8,
            "edge": "amod"
          }
        ],
        "12": [
          {
            "node": 0,
            "edge": "nsubj"
          },
          {
            "node": 11,
            "edge": "advmod"
          },
          {
            "node": 13,
            "edge": "oprd"
          },
          {
            "node": 14,
            "edge": "punct"
          }
        ]
      },
      "num_edges": 14
    }
  }
}

@andrei-volkau sorry for the late reply! Actually all the fields except for "positions" are needed. If you do not want to use the linguistic features like POS and NER, these two fields can be omitted as well.

hugochan avatar Mar 02 '21 14:03 hugochan