Florian Metze comments

Results 92 comments of


                                            Florian Metze

Eesen for Handwriting Recognition

Hi, absolutely, you should be able to use Eesen for handwriting recognition. The main difference between the “char” and the “phn” scripts is how the lexicon is being generated, the...

Eesen for Handwriting Recognition

With the normal scripts, you provide the decoding script (which calls decode-faster) the model directory, the search graph directory, and the test data directory. It will compute the likelihoods using...

Difference between decode-faster vs. latgen-faster

Hi, in most cases lattice based decoding will improve results, it gives better time alignments and allows you to specify a word insertion penalty. It will also give you word...

Missing labels in training / decoding in tf_clean branch

Kalpesh, Ramon would know best about the “v1-tf” recipe, but I can see that there is an error message that says "Can't open data/local/dict_phn/lexicon1.txt: No such file or directory at...

Missing labels in training / decoding in tf_clean branch

Good, not sure about the pickle error, but if you say it does not affect the training, then things should be fine. You should be fine running the test script...

Error Decoding : Tedlium example

Can you describe where exactly this occurs, specifically which branch and which script? Fundamentally, we have 46 phones, but 47 CTC outputs, because index 0 is the blank. The counts...

Error Decoding : Tedlium example

Which branch are you seeing this problem with?

Implementing RNN training on TIMIT database

You will need to provide 48 phones during decoding, I believe, which means 49 symbols including blank. You should be able to generate the priors using the standard call in...

Implementing RNN training on TIMIT database

Are the analyze-counts executables of Kaldi and Eesen different? All the count files that I saw from Eesen contain integer numbers, and the first number is very large, because it...

Implementing RNN training on TIMIT database

tokens.fst contains the epsilon "token" as well as the disambiguation symbols (#1, #2, ...) which you will need to insert into the FST to be able to make words unique....