end-to-end-lipreading icon indicating copy to clipboard operation
end-to-end-lipreading copied to clipboard

Pytorch code for End-to-End Audiovisual Speech Recognition

Results 11 end-to-end-lipreading issues
Sort by recently updated
recently updated
newest added

Hello there, After creating the files via the convert_video.py I try to run the audio-only main.py and get the following issue. It seems to be something wrong with the dimentions...

Hello, In the audiovisual code there is a concat mode (path to pre-trained concat model) is this for the pretrained model in audiovisual? Also in the code 2 references of...

Hello, I have some doubts about the process of training the audiovisual model. Currently, I am following the steps indicated on the README going from temporalconv, backend, and later finetuneGRU...

Hello. I’m doing my research on multimodal AI, such as multimodal ASR. How can I get some pretrained models of audiovisual net?

When I run the main.py, I get the FileNotFoundError: No such file or directory 'MONEY/NoisyAudio/-5dB/MONEY_00581.npz'

Hello, thank you for your work. I would like to ask where did you get the babble noise added in the audio files in your work?

Hello, I scan the code and the paper, and I can't find the code about adding the babble noise of different levels to the audio clip. Could you please tell...

Hi. I would like to know how to add train, validation and test data after executing main.py prgrm in audio only folder?. Plz reply.

Hi, thanks for your work. Please can you provide the pretrained model for audio. Also the pretrained model for video. What I need in order to get them? Just an...

@mpc001 Thank you for your code ! I want to run your code, and I found that in your code , you write the ResNet34 yourself while the Pytorch provide...