Lipreading_using_Temporal_Convolutional_Networks
Lipreading_using_Temporal_Convolutional_Networks copied to clipboard
How to train the model on audiovisual mode
Hi, I saw this question in am open thread but I didn't see a response so I'm opening one so others can search it in the future. First of all thank you for providing such a detailed read me! I'm interested in doing the audiovisual lip reading but I was wondering how to do that with the existing code since in your documentation there are mentiones for audio only and visual only
I have the same question. Looking forward to reply.