guided_summarization
guided_summarization copied to clipboard
Advice on training the model
Hello,
I started training the model on keyword based guidance (oracle keywords) on a 2x 2080 TI system and trained it for around 36 hours, and tested it. The models seems to be consistently generating gibberish output. (The model trained for about 17 epochs until this stage(
Do you have any advice on how many epochs the model needs to converge? and if this is the same for sentence guided signals as well? Do you have any other advice on model training?
Hi, 5 epochs should be enough. I think you can first debug by using sentence-guided signals.