flashlight
flashlight copied to clipboard
Different log base for scoring calculation with kenlm
Question
The log prob score in KenLM is caculated by log_10. (base is 10) https://github.com/kpu/kenlm/issues/191#issuecomment-439011400
However, the decode processes (Building the Trie tree) use log_e (base is e -> natural logarithm) for most case calculations, such as TrieLogAdd in Trie.cpp . https://github.com/flashlight/flashlight/blob/fa118d5f3c4bcde0022855ea887b46d27bd60304/flashlight/lib/text/decoder/Trie.cpp#L65 Is it make incorrect scores?