lingua icon indicating copy to clipboard operation
lingua copied to clipboard

Option: Other

Open thsm-kb opened this issue 1 year ago • 3 comments

Great tool - thank you! Suggestion: The possibility to add OTHER as a language. Lets say I want to find English and French in a multi-language set. I want to add English and French to LanguageDetectorBuilder.from_languages, but if the probability is low, I don't want everything to be marked as English or French, but something else -> Other.

thsm-kb avatar Nov 09 '22 08:11 thsm-kb

Thanks for the suggestion @thsm-kb. In fact, I was already thinking about changing the detector's behavior in this way. I will most probably implement something like this. So please stay tuned.

pemistahl avatar Nov 15 '22 08:11 pemistahl

Any updates on this @pemistahl ? I'm currently searching for something to inspect if a website is written in Danish or not. I would love to use Lingua since it was a great experience last I used it.

thsm-kb avatar Oct 31 '23 08:10 thsm-kb

No, there is no progress yet. I maintain four implementations of Lingua, currently I'm writing Python bindings for the Rust implementation. If there is progress, you will see it in the release notes.

pemistahl avatar Oct 31 '23 09:10 pemistahl