LDong comments

Results 7 comments of


                                            LDong

How to divide the dataset into train/val/test?

Thank you for your reply! > There are no predefined train/test splits, but the dataset is split into 5 folds of equal size. A common procedure is to use 4...

Unknown definition of "qid" and "type" in annotations

> Hi, thanks for the interest. `qid' gives the question id within a video. So video_qid identifies a sample of the whole dataset. There is no binary question in the...

Unknown definition of "qid" and "type" in annotations

> Yes. Thank you. And there are 8 types of questions, ['DC', 'CW', 'CH', 'TN', 'TC', 'DL', 'DO', 'TP'], what do they mean respectively?

Memory file -> numpy -> memory file

> Thanks for good library for us. As mentioned in title I want to process video in raw byte to numpy then encode it into raw byte video. I succeed...

What data used in training?

are the LRS2 and LRS3 datasets pre-training dataset? I can not found any other description about the pre-training dataset.

Why is my count for the number of videos in htm_aa_v1.csv 247, 564 instead of 370K?

> htm_aa_v1.csv should be the output of TAN, which will be used for downstream representation learning. It is not the HTM370k training set for TAN. Will you release a list...

vid2seq installation error: No matching distribution found for t5x

> It seems the t5x installation fails. You may try to follow the t5x installation instructions from https://github.com/google-research/t5x. The [t5x gpu](https://github.com/google-research/t5x/blob/main/t5x/contrib/gpu/scripts_gpu/README.md) is so hard to use. After **pip install -r...