Sung-Lin Yeh comments

Results 18 comments of


Sung-Lin Yeh

Insertion problem when decoding with pre-trained ASR model.

After carefully applying masking to the fbank and normalization, I found the encoder outputs vary with batch size. As @danpovey mentioned, the padding for cov layers can cause such difference....

Insertion problem when decoding with pre-trained ASR model.

> > @30stomercury could you take a look at that? I don't understand why the results vary with batch size here ... > > I think several reasons can cause...

Insertion problem when decoding with pre-trained ASR model.

Hi @GaetanLepage , I've solved it in #933. Please see the results I got below. It would be great if you can help us test the numbers as well. `batch_size=8`...

Insertion problem when decoding with pre-trained ASR model.

Great, @GaetanLepage thanks for the testing.

SpeechBrain 0.6.0

Thanks @TParcollet , I'm investigating kenlm and see if we can have BPE in ARPA format. I will add some to-dos later

SpeechBrain 0.6.0

Hi @mravanelli and @TParcollet, this is the current status of this pr. - [x] Support NgramScorer with kenlm. - [x] full scorer mode - [x] partial scorer mode - [x]...

SpeechBrain 0.6.0

@Gastron Sure, we can do it here. I will add it to the to-dos.

SpeechBrain 0.6.0

Hi @mravanelli , thank you for your suggestions. 1, 2. Currently I don't have a better way to manage importing part, I will finish the docstring first. 3. Yes, that...

Hi @mravanelli , I propose two ways to manage scorers in yaml. The second approach is similar to the way you suggested. ``` # Method1 scorer: !new:speechbrain.decoders.scorer.ScorerBuilder eos_index: !ref blank_index:...