Results 2 issues of Berthy

Running a pre-trained model on COCO, I'm getting strange predicted captions (examples shown below). Has anyone else run into this issue? ```image 391895: wipe liner undergoing wipe mutt topless wipe...

Thanks for the great implementation. Does this function the exact same way as the Stanford Scene Graph Parser? If not, how is it different from the Stanford parser?

good first issue