bofeng huang comments

Results 33 comments of


bofeng huang

DynamicBatchSampler

Hi @anautsch, thank you very much for your explanations ! And yeah, my question was exactly why using the arbitrary lognorm but not one with the parameters fitted on dataset....

DynamicBatchSampler

@anautsch nice! Look forward to our discussion :)

DynamicBatchSampler

Hi @anautsch , As discussed last week, I came back with the tests on more datasets : - MiniLibriSpeech - LibriSpeech - CommonVoice IT - CommonVoice FR I used `batch_size=32`...

@anautsch thanks for the rewriting. I didn't realize the dynamic sampler with beta distribution has the less initialization time (except on LibriSpeech). Just when fitting the beta distribution on datasets,...

DynamicBatchSampler

Yeah it was the same "total time" used in the tutorial. > Interesting that fewest padding is not always fewest batches! I think this is related to datasets, #buckets and...

New DynamicBatchSampler

There are also some other changes including: - Change `_ex_lengths` from dict to np array - Isolate the function `_get_bucket_ids` which distributes the examples into buckets by bucket boundaries, to...

New DynamicBatchSampler

Hi @anautsch , I've run some real tests on LibriSpeech-ASR-CTC-wav2vec recipe with different setups on a single rtx8000. `num_workers=1` is added to the defaut recipe to get a little speed...

New DynamicBatchSampler

So I modified to fill up the batches by `batch_size` in order to get the same number of batches and ended up with the following results. | | test clean...

Update prep_mtedx_data.py

Hi again @kahne , `soundfile.write` accepts an array of `(T x C)`, but `_wavform` here has `(C x T)`. Fixed here in 2nd commit

Update prep_mtedx_data.py

Hi, `is_audio` arg has been added to `get_zip_manifest` for audio/fbanck in the 3rd commit