avobjects icon indicating copy to clipboard operation
avobjects copied to clipboard

Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"

Results 9 avobjects issues
Sort by recently updated
recently updated
newest added

Hi, i want to ask about data for training. In, paper, LRS2, LRS3 are used for training and validation. So, I download LRS2, LRS3 dataset. These datasets consist of videos...

Dear authors: Thanks for sharing this fancy idea work, it looks really novel and effective! But when i try to test it on my own video, such as several people...

We are passing videos of different durations through the model and we are getting audio embedding of different sizes. Is this normal ? We are looking forward to use the...

Thanks for great work. I want to train this model for my custom data. Is there any plan to distribute training code? Thanks.

Thank you for sharing such awesome work! But how to achieve the [demo of Active Speaker Detection](https://www.youtube.com/watch?v=A5tmRjpxHvA&feature=emb_logo) as the [Project page](https://www.robots.ox.ac.uk/~vgg/research/avobjects/) shows? I noticed the function [viz_boxes_with_scores](https://github.com/afourast/avobjects/blob/4a9d0d5af373d682be29487e68b9233809552e08/viz_utils.py#L121) but I didn't...

Hi, thanks for share your great work. Could I have 2 quetions for you? 1 > I have just try to run your code, but the visualization looks like there...

python main.py --resume checkpoints/avobjects_loc_sep.pt --input_video demo.mp4 --output_dir demo_out Using device: cuda rm demo_out/* -rf Checkpoint checkpoints/avobjects_loc_sep.pt loaded! Resampling media/demo.mp4 to 25 fps /opt/anaconda3/envs/avobjects/lib/python3.6/site-packages/torchvision/transforms/functional.py:405: UserWarning: Argument interpolation should be of type...