whatlanggo
whatlanggo copied to clipboard
detection problem for short text / training option
Hi,
Hope you are all well !
I have a problem to detect french language on short sentences like the one below.
| Sentence | Language Detected | Real Language | Location |
|---|---|---|---|
| Ras. | Esperanto | French | France |
| RAS bon. | Esperanto | French | France |
| PAS DE SOUCI. | Portuguese | French | France |
| Bien. | Spanish | French | France |
| RIEN A SIGNALER. | Spanish | French | France |
| Nickel. | Polish | French | France |
| Pas assez de recul. | Portuguese | French | France |
| Je recommande. | Dutch | French | France |
Is there a way to train the model with additional patterns/sentences in order to improve detection confidence ?
Btw, I know the location of these sentence, like they are all from France, is there a way to influence the score with an additional parameter like the location ?
Thanks in advance for any insights or solutions !
Cheers, X