whatlanggo icon indicating copy to clipboard operation
whatlanggo copied to clipboard

detection problem for short text / training option

Open ghost opened this issue 4 years ago • 0 comments

Hi,

Hope you are all well !

I have a problem to detect french language on short sentences like the one below.

Sentence Language Detected Real Language Location
Ras. Esperanto French France
RAS bon. Esperanto French France
PAS DE SOUCI. Portuguese French France
Bien. Spanish French France
RIEN A SIGNALER. Spanish French France
Nickel. Polish French France
Pas assez de recul. Portuguese French France
Je recommande. Dutch French France

Is there a way to train the model with additional patterns/sentences in order to improve detection confidence ?

Btw, I know the location of these sentence, like they are all from France, is there a way to influence the score with an additional parameter like the location ?

Thanks in advance for any insights or solutions !

Cheers, X

ghost avatar Mar 23 '20 05:03 ghost