avobjects issues

What data used in training?

1

Hi, i want to ask about data for training. In, paper, LRS2, LRS3 are used for training and validation. So, I download LRS2, LRS3 dataset. These datasets consist of videos...

yw0nam

Performance bad on own video?

1

Dear authors: Thanks for sharing this fancy idea work, it looks really novel and effective! But when i try to test it on my own video, such as several people...

dragen1860

Different embedding dimensions

We are passing videos of different durations through the model and we are getting audio embedding of different sizes. Is this normal ? We are looking forward to use the...

Shreyank21

How to train the model, and get the "avobjects_los_sep.pt" ？

Overcautious

Is there any plan to distribute training code for custom data?

Thanks for great work. I want to train this model for my custom data. Is there any plan to distribute training code? Thanks.

yw0nam

how to achieve the demo of Active Speaker Detection

Thank you for sharing such awesome work! But how to achieve the [demo of Active Speaker Detection](https://www.youtube.com/watch?v=A5tmRjpxHvA&feature=emb_logo) as the [Project page](https://www.robots.ox.ac.uk/~vgg/research/avobjects/) shows? I noticed the function [viz_boxes_with_scores](https://github.com/afourast/avobjects/blob/4a9d0d5af373d682be29487e68b9233809552e08/viz_utils.py#L121) but I didn't...

KiAlexander

The visualization looks like there is bug in code

1

Hi, thanks for share your great work. Could I have 2 quetions for you? 1 > I have just try to run your code, but the visualization looks like there...

vuthede

Excuse me, error when running "python main.py --resume checkpoints/avobjects_loc_sep.pt --input_video demo.mp4 --output_dir demo_out"

1

python main.py --resume checkpoints/avobjects_loc_sep.pt --input_video demo.mp4 --output_dir demo_out Using device: cuda rm demo_out/* -rf Checkpoint checkpoints/avobjects_loc_sep.pt loaded! Resampling media/demo.mp4 to 25 fps /opt/anaconda3/envs/avobjects/lib/python3.6/site-packages/torchvision/transforms/functional.py:405: UserWarning: Argument interpolation should be of type...

zhang123-sys

avobjects
avobjects copied to clipboard

Metadata

What data used in training?

Performance bad on own video?

how to train your model?

Different embedding dimensions

How to train the model, and get the "avobjects_los_sep.pt" ？

Is there any plan to distribute training code for custom data?

how to achieve the demo of Active Speaker Detection

The visualization looks like there is bug in code

Excuse me, error when running "python main.py --resume checkpoints/avobjects_loc_sep.pt --input_video demo.mp4 --output_dir demo_out"

← Metadata

Owner

Metadata

avobjects avobjects copied to clipboard

Metadata

← Metadata

Owner

Metadata

avobjects
avobjects copied to clipboard