Spell-Checker icon indicating copy to clipboard operation
Spell-Checker copied to clipboard

Good training but bad results

Open iuria21 opened this issue 7 years ago • 5 comments

Hello, first thanks for your work. I trained the model with the same data as you, and I got similar results in the training (except the time): english_training

But then when I try to correct some examples the result is awful... these are the outputs for your same examples: english_examples

Do you know why could it be? The vocabulary is the same, and I don't know why the number of sentences it's not the same but very similar. And, do you think that it will succes if i feed it with more data? I also tried with spanish and the result is not satisfactory...

Thanks!

iuria21 avatar Feb 16 '18 09:02 iuria21

good question.

shibing624 avatar Mar 29 '18 06:03 shibing624

I think that it's not enough noise in the train set. One suggestion is that you increase the threshold one by one for train your model to work with difficult sentences.

HariWu1995 avatar Sep 27 '18 09:09 HariWu1995

try seq2seq with attention, your testing loss is 1.248 is not good enough. i think 0.05 is good for this task. 55873 281/284 [============================>.] - ETA: 6s - loss: 0.0510 55874 282/284 [============================>.] - ETA: 4s - loss: 0.0510 55875 283/284 [============================>.] - ETA: 2s - loss: 0.0510 55876 284/284 [==============================] - 599s 2s/step - loss: 0.0510 - val_loss: 0.2124

shibing624 avatar Sep 28 '18 03:09 shibing624

@shibing624 could you please share the code sample for using seq2seq attention?

SuMyo avatar Feb 26 '20 20:02 SuMyo

https://github.com/shibing624/pycorrector

shibing624 avatar Feb 27 '20 05:02 shibing624