Cesc comments

Results 32 comments of


                                            Cesc

what's the intention of mocha first layers?

I test my 1000h-data streaming model, the perfermance is bad because lower decoder layers make very confused alignments, now I get the idea of so called "attention heads pruning in...

what's the intention of mocha first layers?

definitively

implementation of `safe_cumprod`

@bo-son I think you're right

RuntimeError exporting stateless5 model using jit

> Also, would you mind creating a PR to fix the outdated code? Sure thing, I'll do it later.

on-the-fly fbank feats

Thanks for the reply, so I could just skip making fbank stages in prepare.sh?

on-the-fly fbank feats

Thanks mate.

on-the-fly fbank feats

@csukuangfj I found on-the-fly feats computation makes training much slower, for example it cost 20 seconds using pre computed kaldi fbank feats for 50 batch iteration and it took about...

on-the-fly fbank feats

> Are you using raw waves? Also, is your disk fast? Yes I'm using raw waves and how to check my disk is fast or slow?

on-the-fly fbank feats

BTW, I've trained using raw waves with Espnet, the gpu utility is around 70% which I think is normal , the difference is in Espnet I implement Fbank as a...

on-the-fly fbank feats

> Can you try increasing the number of dataloader workers? Perhaps that’s the bottleneck. > > If you want to use fbank as a layer you can modify the code...