Piotr Żelasko comments

Results 523 comments of


                                            Piotr Żelasko

Tune lilcom compression settings

Also some examples of how the batches look like in our custom setup can be seen here.. https://github.com/lhotse-speech/lhotse/pull/234

Tune lilcom compression settings

Thanks @ejguan! This looks pretty cool! I think I understand the whole flow. The GroupBy operator and matching by batch index is a cool idea -- but how does GroupBy...

Reverb in frequency/STFT domain

Thanks! It’s good to know. I think time-domain is fine, it seems on-the-fly audio reading and feature extraction is efficient enough in the setups we tried so far.

error when proceeding the wenetspeech data following k2's prepare.sh

You might want to try decreasing num_jobs and max_duration in that script. I don't know why this sometimes happens with `compute_and_store_features_batch` but I've seen that before.

Stateful augmentation (e.g. using a schedule)

... there's also the question of storing/restoring the state to resume training from a checkpoint.

duration param in extract_from_recording_and_store

It looks like a bug, what you're suggesting seems correct. Do you mind making a PR with your fix and a unit test to check that it's doing the right...

Issues about feature extraction and lilcom: last stride should be 1

Thanks for letting us know! Actually I think I can add a check before lilcom compression so that when it detects only (or mostly) non-negative values/narrow value range, it will...

SSL: CERTIFICATE_VERIFY_FAILED

Hmmm I'm not sure we want to do that by default. We could have a flag to turn off SSL for this recipe, e.g. using this solution https://stackoverflow.com/questions/33770129/how-do-i-disable-the-ssl-check-in-python-3-x Would you...

SSL: CERTIFICATE_VERIFY_FAILED

@oshindow can you try the solution above?

Loading sampler state dict error

Hmm we had a similar issue here, but it's already fixed https://github.com/lhotse-speech/lhotse/issues/753 Can you print the contents of the sampler's `state_dict` that's saved on disk? It would also be helpful...