end-to-end-lipreading issues

Results 11 end-to-end-lipreading issues

Sort by recently updated

shape '[-1, 29, 512]' is invalid for input of size 497664

Hello there, After creating the files via the convert_video.py I try to run the audio-only main.py and get the following issue. It seems to be something wrong with the dimentions...

jkamb1

Error in Audiovisual

Hello, In the audiovisual code there is a concat mode (path to pre-trained concat model) is this for the pretrained model in audiovisual? Also in the code 2 references of...

jiarouk

training audiovisual net with and without pretrained models

Hello, I have some doubts about the process of training the audiovisual model. Currently, I am following the steps indicated on the README going from temporalconv, backend, and later finetuneGRU...

msanchez-fi

How can we get pretrained models?

Hello. I’m doing my research on multimodal AI, such as multimodal ASR. How can I get some pretrained models of audiovisual net?

yskimno1

How can I get the "NoisyAudio/-5dB/MONEY_00581.npz"?

When I run the main.py, I get the FileNotFoundError: No such file or directory 'MONEY/NoisyAudio/-5dB/MONEY_00581.npz'

flyyyyer

About noisy audio files

Hello, thank you for your work. I would like to ask where did you get the babble noise added in the audio files in your work?

gopzgopez

about training the audio-only model

Hello, I scan the code and the paper, and I can't find the code about adding the babble noise of different levels to the audio clip. Could you please tell...

kisstherainfh

How to add train data in audio only main prgrm?

Hi. I would like to know how to add train, validation and test data after executing main.py prgrm in audio only folder?. Plz reply.

aloknagral

pretrained models

Hi, thanks for your work. Please can you provide the pretrained model for audio. Also the pretrained model for video. What I need in order to get them? Just an...

mariela-dev

About the ResNet !

@mpc001 Thank you for your code ! I want to run your code, and I found that in your code , you write the ResNet34 yourself while the Pytorch provide...

CXiaoDing

end-to-end-lipreading
end-to-end-lipreading copied to clipboard

Metadata

shape '[-1, 29, 512]' is invalid for input of size 497664

Error in Audiovisual

training audiovisual net with and without pretrained models

How can we get pretrained models?

How can I get the "NoisyAudio/-5dB/MONEY_00581.npz"?

About noisy audio files

about training the audio-only model

How to add train data in audio only main prgrm?

pretrained models

About the ResNet !

← Metadata

Owner

Metadata

end-to-end-lipreading end-to-end-lipreading copied to clipboard

Metadata

← Metadata

Owner

Metadata

end-to-end-lipreading
end-to-end-lipreading copied to clipboard