lingua-py icon indicating copy to clipboard operation
lingua-py copied to clipboard

Luxembourgish

Open astuanax opened this issue 11 months ago • 6 comments

Would it be possible to include Luxembourgish?

I believe 2 EU langauges are missing from the list: Maltese and Luxembourgish.

It seems Thierry Goeckel already build luxdetection, but maybe we can integrate this? https://github.com/rotzbouw/luxdetect

Would be happy to discuss how to go forward and help out.

astuanax avatar Jul 28 '23 08:07 astuanax

Hi @astuanax, thanks for your request.

I'm planning to add 25 more languages to Lingua so that it supports a total of 100 languages then. I'm pretty sure that Maltese and Luxembourgish will be among those new languages. It may take a while, however.

Before starting that, I will first evaluate whether it's possible to use the Rust port of Lingua within Python because the pure Python port is actually very slow. The Rust port is significantly faster.

pemistahl avatar Aug 03 '23 08:08 pemistahl

Sure, I understand, let me know if I can help with testing.

astuanax avatar Aug 09 '23 13:08 astuanax

@pemistahl can ML libraries accelerate Python's performance?

TomLucidor avatar Nov 03 '23 07:11 TomLucidor

@TomLucidor I'm currently writing Python bindings for the Rust implementation which will eventually replace the pure Python implementation. This will solve most performance issues.

pemistahl avatar Nov 03 '23 12:11 pemistahl

Hello @pemistahl will there be Occitan and Kabyle languages in your 100 new supported languages? Best regards

Mejans avatar Nov 06 '23 17:11 Mejans

Hi @Mejans, I won't add a set of 100 new languages. I was talking about 25 new languages. That's far enough work for now.

I haven't decided yet which languages to include but I'm in favor of including some minority languages as well. So thank you for proposing Occitan and Kabyle. I will keep them in mind.

pemistahl avatar Nov 07 '23 19:11 pemistahl