practical-pytorch icon indicating copy to clipboard operation
practical-pytorch copied to clipboard

Train and Test Split

Open rjtmehta99 opened this issue 6 years ago • 1 comments

In the Char RNN classification I had some questions:

  1. Why have not split the dataset into train and test/validation set to check the performance ?
  2. How can we add 'accuracy' as a metric to evaluate model performance ?

rjtmehta99 avatar Jan 16 '19 13:01 rjtmehta99

I agree with this. Currently he tests on the training data, and draws a misleading confusion matrix. He trains on 100K names with repetitions, and tests against 10K from the same population, yet again with repetition. You can't infer accuracy from this. You need to split to train / test and evaluate on the test after every epoch. Please see my notebook.

masalha-alaa avatar Aug 04 '21 15:08 masalha-alaa