LipReading
LipReading copied to clipboard

Published 20 hours ago •

Reame
Issues

Will unseen model predict for any video content or content only from GRID.txt ?

Open chahatagarwal opened this issue 4 years ago • 0 comments

Why is the format of prediction as well for training defined as command(4) + color(4) + preposition(4) + letter(25) + digit(10) + adverb(4).
Will it work for any video I use to predict with the help of unseen model weights? (As per my understanding, It extracts the lip region using dlib and then try to map visual content to word conversion model?)

Apr 21 '20 10:04 chahatagarwal