End-to-End-VAD
End-to-End-VAD copied to clipboard
an Audio-Visual Voice Activity Detection using Deep Learning
Hello, thanks for your excellent work! I am trying to fit your model on my own dataset. As I didn't found any labels in the dataset you provide in the...
Hi there, I would lke to use your dataset for a small project, while I have a small question. I am wondering why the audio parameters set here are different...
Hi, When I train the video modality, I can not get a good result, there may some errors be in the prepared the video labels?
Hi, thanks for your excellent work. When I rerun this code, I get some problem: the training loss is descending and the ACC is improving, but the test ACC is...
Hello Team, Thanks for providing the repo. I have replicated this repo step by step as per the details mentioned in the paper and in this repo. First I have...