Pingchuan Ma comments

Results 14 comments of


Pingchuan Ma

Which methods you use to extract landmarks from image

Hi @Winfredy, we use the [RetinaFace](https://openaccess.thecvf.com/content_CVPR_2020/papers/Deng_RetinaFace_Single-Shot_Multi-Level_Face_Localisation_in_the_Wild_CVPR_2020_paper.pdf) as the face detector and use the 2D facial landmark detector based on [FAN](http://openaccess.thecvf.com/content_ICCV_2017/papers/Bulat_How_Far_Are_ICCV_2017_paper.pdf).

How to add train data in audio only main prgrm?

Hi, we use noisy data augmentation. It's only added in training. The model could be evaluated in different noisy scenarios. Regarding the adding method, it’s similar to [this](https://github.com/mpc001/Lipreading_using_Temporal_Convolutional_Networks/blob/fdc0f1a6d5da93246f9aa6e339bcd3f0d9703c0a/lipreading/preprocess.py#L150).

cant process

A similar issue was raised in #16. Can you please check that you have correctly specified the file path for Windows operating system?

How can I get the "NoisyAudio/-5dB/MONEY_00581.npz"?

Duplicate of #14

How can I compute the landmarks using my own dataset?

Hi, the landmark can be extracted via face tracker. You could check dlib (#1), or face alignment library (#10). We also provide a RetinaFace tracker but it's at [another repository](https://github.com/mpc001/Visual_Speech_Recognition_for_Multiple_Languages#how-to-prepare-models-and-landmarks).

about training the audio-only model

Hi, Please check the attached babble noisy file [babble noise](https://drive.google.com/file/d/1uNo0PaGI-jyNTM-zR_OrNWrppaM-N6SS/view?usp=sharing). You could add a random clip of the noise to the audio and get noisy audio with a specified SNR...

about training the audio-only model

Duplicate of #23

I want to cite your paper but the acc I write 82 or 83.3

Hi, Thanks for your interests! It's better to use 83.3, which is the update. Kind regards, Pingchuan

Error in extract_audio_from_video

Hi, could you check if you've installed `ffmpeg`?

Acc of resnet18_dctcn_video_boundary in my test is wrong

Hello, that's a weird result on the test set. When testing a model without word boundaries, have you encountered a similar issue? By the way, when testing a model that...