Visual_Speech_Recognition_for_Multiple_Languages icon indicating copy to clipboard operation
Visual_Speech_Recognition_for_Multiple_Languages copied to clipboard

Visual Speech Recognition for Multiple Languages

Results 16 Visual_Speech_Recognition_for_Multiple_Languages issues
Sort by recently updated
recently updated
newest added

Which methods you use to extract landmarks from image?

Thanks for the releasement. I wonder if the training code will be available in the future? Thanks.

Could you please share the original GRID data set? There are some missing items online.

Do you still have a copy of the CMU-MOSEAS dataset? I've been informed by the authors that they lost all copies of it. If you still have a copy I...

Can you please tell me the life version of pytorch you are using, I have some errors with the 2.0 version. Thank you!

Hi dear I would ask how I can deal with data already cropped mouth region with distribution size, I want to apply all pre-processing and data augmentation processes on this...

As mentioned in S3, the pre-trained models are always trained on the same data as the full model (yet I do not know the pre-training details), and specially the pre-trained...

Hi, thanks for this great work. I have a question about the section `"3.8 Using Additional Training Data"` from your paper `"Visual Speech Recognition for Multiple Languages in the Wild"`...

Thanks for releasing the awesome work! I noticed that the Chinese lip reading model is based on the visual modality. I used the visual model but it achieved poor performance...