pointer_summarizer
pointer_summarizer copied to clipboard
Training saturates early?
I'm using the same hypers but seeing this for my training curve. Why would this happen? Looks like the LR is too high but your curve with the same lr seems fine.
data:image/s3,"s3://crabby-images/7433d/7433d3dfe2f8d9c7c5951cbf6b80923e2a5a3fce" alt="Screen Shot 2020-03-25 at 4 27 14 PM"
data:image/s3,"s3://crabby-images/c8efc/c8efcc66ede5540e81ced79f1650d8a301b4b610" alt="Screen Shot 2020-03-25 at 4 27 23 PM"
This seems to be an issue with the python 3 data loading where everything is mapped to OOV
@ajoshi80 were you able to resolve this?
@jivatneet @ajoshi80
The issue is that since Python 3 all tokens are byte variables and therefore they are recognized as OOV. Changing everything back to string representations seems to work.