Shruti Mittal
Shruti Mittal
Hey, sorry was travelling last 2 days. Attaching cprofile on google colab. This is after setting `cache_on_load = True` and `preload_wav=True` in dataset.py file; `num_workers=1` data:image/s3,"s3://crabby-images/4f365/4f365171e8f77db607998e278419f9960a45dbc1" alt="Screenshot from 2020-03-13 14-26-46" l...
Hi @pswietojanski Cpu usage looks ~100%. Any comments here? this is htop output on GCP - for 1st epoch, using `num_workers=16`; `P100`; `cache_on_load=True`; `preload_wav=True` data:image/s3,"s3://crabby-images/4829f/4829fec4891c99becdec3f0964494a32ebf8d647" alt="Screenshot from 2020-03-13 16-45-27" this is...
I did `os.environ["OMP_NUM_THREADS"] = "1"`, `num_workers = 4` doesn't reduce the time much (20-30min max)
Hey i am getting better speed now, was using lower no. of CPU cores and `K80` machine earlier. with `num_workers = 8` and `V100` time to train 1 epoch is...
with `num_workers = 16` `P100` setting `os.environ["OMP_NUM_THREADS"] = "1"` the epoch trains in ~90mins `preload_wav = True` `cache_on_load = True` however preloading is not improving speed for epoch 2 onwards...
why is caching not increasing the speed? - setting `preload_wav = True` and `cache_on_load=True` in train.py. Could cpu to gpu data transfer time be a bottleneck?
Hey, sorry this was long back. Dont remember the details now. I pretty much followed the ReadMe and the training scripts to understand the data pre-processing pipeline.
Hey did you segment your data? I think i got a similar error when I didnt
No, check the script at `/data/prep/prepare_segmented_dataset_libri.py`