kenlm
kenlm copied to clipboard
KenLM: Faster and Smaller Language Model Queries
kenlm is available as a port in [vcpkg](https://github.com/microsoft/vcpkg), a C++ library manager that simplifies installation for kenlm and other project dependencies. Documenting the install process here will help users get...
Hi, I see that the collapse values can be positive, and when loading the model using kenlm.Model, I get a complaint that there are positive log probs. Is there a...
Fix build for Windows
Has anyone tried compiling this to WASM? Is it at all on the roadmap?
Hello, Thanks a lot for this amazing library. It seems like more and more speech libraries are moving to Python, *e.g.*: - Fairseq: https://github.com/pytorch/fairseq - Transformers: https://github.com/huggingface/transformers - Espnet: https://github.com/espnet/espnet...
cmake version = 3.21.3 (on linux) Problems l am facing: 1- first error when running: python -m pip install . --install-option="--max_order 7": Running setup.py install for kenlm ... error ERROR:...
I have tried those commands for converting both to intermediate state to start interpolation of both language models ``` bin/lmplz -o 3 --intermediate set1.intermediate -7.998156 0 > -7.995031 5æfkÕROc¬ÇJáЯ:0mJWIB#N2Ú?/CÞ|pMFÖõš!uÃôq0thÜv7×fŒŸÔa+z¥Ãp\[ÖD£3ò~i8Íâ_`JBO...
in [python/example.py](https://github.com/kpu/kenlm/blob/master/python/example.py) ```python #Stateful query state = kenlm.State() state2 = kenlm.State() #Use as context. If you don't want , use model.NullContextWrite(state). model.BeginSentenceWrite(state) accum = 0.0 accum += model.BaseScore(state, "a", state2)...
Hi, Thank you for all your work and shares! I'm trying to use kenlm to make some rule based language models, on small texts to detect child text missreading. I...
Why is this happening? python3 generate_lm.py --input_txt data.txt --output_dir . --top_k 2 --kenlm_bins /mnt/c/Users/eliso/speech2text/STT/kenlm/build/bin/ --arpa_order 5 --max_arpa_memory "85%" --arpa_prune "0|0|1" --binary_a_bits 255 --binary_q_bits 8 --binary_type trie Converting to lowercase and...