franc icon indicating copy to clipboard operation
franc copied to clipboard

Natural language detection

Results 4 franc issues
Sort by recently updated
recently updated
newest added

I'd like to play with patching franc, or making some alternative to it, that can detect the language of small documents much more accurately. First of all is this something...

`Что это за язык?` is a Russian sentence, which is detected as Bulgarian (bul 1, rus 0.938953488372093, mkd 0.9353197674418605). However, neither Bulgarian nor Macedonian have the letters э and ы...

Currently franc to me often returns a probability close to 1 for many languages, IMO all these probabilities should be normalized to add up to 1. Also there seems to...

## sentence 1 特別推薦的必訪店家「ヤマシロヤ」,雖然不在阿美橫町上,但就位於JR上野站廣小路口對面 ``` jpn 1 google translate result is Chinese correctly ``` ## sentence 2 特別推薦的必訪店家,雖然不在阿美橫町上,但就位於JR上野站廣小路口對面 ``` cmn 1 google translate result is Chinese correctly ``` Sentence 1...