Vladimir Iashin

Results 7 repositories owned by Vladimir Iashin

BMT

223
Stars
56
Forks
Watchers

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

CS231n

50
Stars
9
Forks
Watchers

PyTorch/Tensorflow solutions for Stanford's CS231n: "CNNs for Visual Recognition"

MDVC

138
Stars
19
Forks
Watchers

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

SpecVQGAN

323
Stars
36
Forks
Watchers

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

video_features

504
Stars
95
Forks
Watchers

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

WebsiteYOLO

41
Stars
64
Forks
Watchers

The back-end for the YOLOv3 object detector running as a webapp

SparseSync

45
Stars
8
Forks
Watchers

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)