decaNLP icon indicating copy to clipboard operation
decaNLP copied to clipboard

Training on custom dataset and answer is always "o"

Open ashleyyy94 opened this issue 5 years ago • 0 comments

I am trying to train on a custom WOZ dataset. I have 2 jsonl files, train.jsonl and val.jsonl as suggested.

A sample line from my dataset is {"context": "no, i just need to make sure it's cheap. oh, and i need parking", "question": "What is the change in state?", "answer": "name: not mentioned, area: not mentioned, parking: yes, pricerange: cheap, stars: not mentioned, internet: not mentioned, type: hotel;"}

This should be the correct format for the dataset training tuple. The problem is when I run the training process, when it tries to run validation after 1000 iterations (--val_every 1000), the output for the answer is always "o". As shown below:

greedy: 'food: chinese, pricerange: expensive, name: not mentioned, area: centre;'

answer: 'o'

context: 'how about the centre?'

question: 'what is the change in state?'

May I know what I'm doing wrongly? The wrong answer "o" is causing all the metrics like joint goal em, nf1 and nem to be zero.

Thank you.

ashleyyy94 avatar Mar 16 '19 14:03 ashleyyy94