RL-based-Graph2Seq-for-NQG
RL-based-Graph2Seq-for-NQG copied to clipboard
Question: which input fields are actually required for the model?
I am sending an example.
I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.
{
"text3": "Twins",
"text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
"text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
"annotation3": {
"raw_text": "Twins",
"toks": "Twins",
"POSs": "NNS",
"positions": "0-0-5",
"NERs": "O"
},
"id": "56de8c374396321400ee2a11",
"annotation2": {
"raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?",
"toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?",
"POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .",
"positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89",
"NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O"
},
"annotation1": {
"raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
"toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .",
"POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .",
"positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78",
"NERs": "O O DATE O O O O O O O O O O O O",
"graph": {
"g_features": [
"twins",
"-lrb-",
"1988",
"-rrb-",
",",
"a",
"comedy",
"with",
"danny",
"devito",
",",
"also",
"proved",
"successful",
"."
],
"g_adj": {
"0": [
{
"node": 1,
"edge": "punct"
},
{
"node": 2,
"edge": "appos"
},
{
"node": 3,
"edge": "punct"
},
{
"node": 4,
"edge": "punct"
},
{
"node": 6,
"edge": "appos"
},
{
"node": 10,
"edge": "punct"
}
],
"6": [
{
"node": 5,
"edge": "det"
},
{
"node": 7,
"edge": "prep"
}
],
"7": [
{
"node": 9,
"edge": "pobj"
}
],
"9": [
{
"node": 8,
"edge": "amod"
}
],
"12": [
{
"node": 0,
"edge": "nsubj"
},
{
"node": 11,
"edge": "advmod"
},
{
"node": 13,
"edge": "oprd"
},
{
"node": 14,
"edge": "punct"
}
]
},
"num_edges": 14
}
}
}
I am sending an example.
I trained a model. I would like to try it on my own data. Could you please clarify which fields are actually required for the model? It might be possible that I don't need to provide all of them while shaping my own data as per the format below.
{ "text3": "Twins", "text1": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .", "text2": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?", "annotation3": { "raw_text": "Twins", "toks": "Twins", "POSs": "NNS", "positions": "0-0-5", "NERs": "O" }, "id": "56de8c374396321400ee2a11", "annotation2": { "raw_text": "What's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988?", "toks": "What 's the title of the comedy movie Schwarzenegger starred in with Danny DeVito in 1988 ?", "POSs": "WP VBZ DT NN IN DT NN NN NNP VBD IN IN NNP NNP IN CD .", "positions": "0-0-4 1-4-6 2-7-10 3-11-16 4-17-19 5-20-23 6-24-30 7-31-36 8-37-51 9-52-59 10-60-62 11-63-67 12-68-73 13-74-80 14-81-83 15-84-88 16-88-89", "NERs": "O O O O O O O O PERSON O O O PERSON PERSON O DATE O" }, "annotation1": { "raw_text": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .", "toks": "twins -lrb- 1988 -rrb- , a comedy with danny devito , also proved successful .", "POSs": "NNS JJ CD NN , DT NN IN JJ NN , RB VBD JJ .", "positions": "0-0-5 1-6-11 2-12-16 3-17-22 4-23-24 5-25-26 6-27-33 7-34-38 8-39-44 9-45-51 10-52-53 11-54-58 12-59-65 13-66-76 14-77-78", "NERs": "O O DATE O O O O O O O O O O O O", "graph": { "g_features": [ "twins", "-lrb-", "1988", "-rrb-", ",", "a", "comedy", "with", "danny", "devito", ",", "also", "proved", "successful", "." ], "g_adj": { "0": [ { "node": 1, "edge": "punct" }, { "node": 2, "edge": "appos" }, { "node": 3, "edge": "punct" }, { "node": 4, "edge": "punct" }, { "node": 6, "edge": "appos" }, { "node": 10, "edge": "punct" } ], "6": [ { "node": 5, "edge": "det" }, { "node": 7, "edge": "prep" } ], "7": [ { "node": 9, "edge": "pobj" } ], "9": [ { "node": 8, "edge": "amod" } ], "12": [ { "node": 0, "edge": "nsubj" }, { "node": 11, "edge": "advmod" }, { "node": 13, "edge": "oprd" }, { "node": 14, "edge": "punct" } ] }, "num_edges": 14 } } }
@andrei-volkau sorry for the late reply! Actually all the fields except for "positions" are needed. If you do not want to use the linguistic features like POS and NER, these two fields can be omitted as well.