datasets icon indicating copy to clipboard operation
datasets copied to clipboard

Feature: BOBSL dataset integration

Open cleong110 opened this issue 1 year ago • 1 comments

Complete work on https://github.com/sign-language-processing/datasets/pull/5

https://www.robots.ox.ac.uk/~vgg/data/bobsl/ is the dataset website

cleong110 avatar Mar 20 '24 20:03 cleong110

I will probably take on this as I am working more on BOBSL.

The first step for me is to construct a BOBSL_ISLR dataset consisting of ~5M huge but noisy sign-level training examples.

I will then look at the subtitle/sentence-level based on #5 if I work on MT. Apparently, VGG has their own pipeline, a bit messy, will see how to integrate everything efficiently.

J22Melody avatar Dec 18 '24 11:12 J22Melody