Piotr Żelasko comments

Results 523 comments of


                                            Piotr Żelasko

SpecAugment `state_dict` not compatible with PyTorch's

Would you be willing to contribute the fix? You'd need to modify the method here: https://github.com/lhotse-speech/lhotse/blob/e982003c1777a8f48ee40580bd835c931a03062d/lhotse/dataset/signal_transforms.py#L262

Move window tensor to proper device

You can move the Wav2Win (or any other module in that file) to device before running the inference. If you're using `Fbank`, there's an option in the config to place...

Move window tensor to proper device

Wouldn't `module = module.to("cuda")` solve the issue by placing every registered parameter and buffer on the GPU?

Problem with CutSet.from_manifests

I don't think `decompose` was ever tested in this way, although I would have expected it to work. I'm afraid I don't have enough time right now to look into...

Problem with CutSet.from_manifests

Thanks, you're right. I'll keep the issue open for now.

Problem with CutSet.from_manifests

Features does have ` recording_id` field. If you can provide some way to reproduce with a small dataset like yesno or mini Librispeech I can look into it.

Is there any sampler by using batch size, not duration?

You can use DynamicCutSampler and specify max_cuts rather than max_duration.

Default encoding change

That's surprising, I don't think there's any code in lhotse itself that would change the encoding. If the env var works for you that sounds good.

FileNotFoundError: [Errno 2] Unable to synchronously open file

Update the paths to the h5 files in your manifests.

FileNotFoundError: [Errno 2] Unable to synchronously open file

Yes