av_hubert icon indicating copy to clipboard operation
av_hubert copied to clipboard

How to get the ${nshard} value and ${rank} value? Are they random numbers?

Open SE-Nickjackson opened this issue 2 years ago • 3 comments

Hi authors, thank you for the great work! When I preprocess the dataset, I don't know what values ${nshard} and ${rank} should take. Can you help me out?

SE-Nickjackson avatar Sep 03 '22 15:09 SE-Nickjackson

Hi,

${nshard} splits the whole data into ${nshard} parts and ${rank} is the job id processing the ${rank}th part. For example.,python lrs3_prepare.py --lrs3 ${lrs3} --ffmpeg /path/to/ffmpeg --rank ${rank} --nshard ${nshard} --step ${step} is the job for processing the ${rank}-th shard of the LRS3 dataset. To process the whole dataset, you need to launch ${nshard} jobs with ${rank} varying from 0 to ${nshard}-1.

chevalierNoir avatar Sep 03 '22 22:09 chevalierNoir

Ok, thanks

SE-Nickjackson avatar Sep 04 '22 01:09 SE-Nickjackson

Hi after running

python lrs3_prepare.py --lrs3 lrs3/ --ffmpeg /path/to/ffmpeg --rank 10 --nshard 20 --step 1 I am getting

if word_intervals[-1][-1] < max_duration: IndexError: list index out of range

Can you please help it out

shakeel608 avatar Feb 10 '23 20:02 shakeel608