Yun Wang (Maigo) comments

Repositories
Issues
Comments

Results 34 comments of


                                            Yun Wang (Maigo)

What's the place CNN exactly?

Search for "Pretrained Models" on the demo website (https://projects.csail.mit.edu/soundnet/).

Reproduce probabilities and backoffs from python

I get different results from both you and KenLM, but I believe KenLM is making a mistake here. What I got with KenLM on your corpus: ``` -0.6726411 went -0.022929981...

Reproduce probabilities and backoffs from python

There are two other causes for the discrepancy: 1. KenLM does not include `` when calculating the vocabulary size, while your program does. I think KenLM's approach makes more sense...

Wrong calculation of 1-gram adjusted counts?

Another example on which KenLM miscalculates `s.n[1]` and `s.n[2]` can be found in #405. In that example, this affects the discounts, and the probabilities in the final LM.