NeuralBabyTalk
NeuralBabyTalk copied to clipboard
Why is the generated description not related to the detected one? demo file
eg:a [ backpack ] on a [ cheesecake ] on a street。 But the picture does not detect the two words, [ backpack ] , [ cheesecake ]
The same to my problem #43