A.HEBA comments

Results 11 comments of


                                            A.HEBA

Add WFST decoding based on k2 for speechbrain

@luomingshuang ; We start working on your PR, thank you very much for the integration, Just to understand, why you still call the graph HLG ? and not TLG ?,...

Add WFST decoding based on k2 for speechbrain

@luomingshuang , I just run the full exp, and saw: ``` Test-clean: %WER 2005.99 [ 52557 / 2620, 49941 ins, 0 del, 2616 sub ] %SER 100.00 [ 2620 /...

Add WFST decoding based on k2 for speechbrain

> @luomingshuang , I just run the full exp, and saw: > > ``` > Test-clean: > %WER 2005.99 [ 52557 / 2620, 49941 ins, 0 del, 2616 sub ]...

Add WFST decoding based on k2 for speechbrain

> > @luomingshuang , I just run the full exp, and saw: > > ``` > > Test-clean: > > %WER 2005.99 [ 52557 / 2620, 49941 ins, 0 del,...

Add WFST decoding based on k2 for speechbrain

thanks @luomingshuang , I'm able to have same results with `torch=1.8`. After adding our `CategoricalLabel_encoding`, I will move to 1.10.. Let me highlight a small problem where I think we...

Add WFST decoding based on k2 for speechbrain

Hello @csukuangfj , @luomingshuang , Thanks you for the tricks. from my side, it seems not working; Two errors: 1st one if I consider @csukuangfj tricks: ``` Removing disambiguation symbols...

Add WFST decoding based on k2 for speechbrain

> > So the HLG compile, but in the decoding phase, I have an error when using the HLG.fst > > > It seems that the intersect between H_L_G is...

Add WFST decoding based on k2 for speechbrain

Let me start pushing the final updates for handling `CategoricalEncoder`, @csukuangfj, @luomingshuang , is there a comparaison between you `ctc_topo` and the one proposed by Miao in EESEN?

add CTC recipe GigaSpeech

let add: - [ ] Logging duration, num segments... - [ ] Evaluate adding shards & webdatasets - [ ] CTC recipe

There is a problem about training a RNN-T model?

Hello, We have a similar implementation within speechbrain (implemented with Python-Numba), you can take a look https://github.com/speechbrain/speechbrain/tree/develop/recipes/LibriSpeech/ASR/transducer transducer implementation: https://github.com/speechbrain/speechbrain/blob/develop/speechbrain/nnet/loss/transducer_loss.py otherwise, you have the torchaudio supporting this lib within torchaudio...