mouse-dictionary icon indicating copy to clipboard operation
mouse-dictionary copied to clipboard

Is there a way to recognize characters other than Japanese and the alphabet?

Open yamadayo opened this issue 5 years ago • 8 comments

Is there a way to recognize characters other than Japanese and the alphabet?

yamadayo avatar Dec 14 '20 03:12 yamadayo

To what language do you want to use?

I have a plan to develop a feature that enables users to inject user-customized language processer, so I want to know your use case.

wtetsu avatar Dec 16 '20 13:12 wtetsu

chinese

yamadayo avatar Dec 17 '20 01:12 yamadayo

@wtetsu I want to use it for eng -> turkish, is this only eng -> jp?

ghost avatar Dec 19 '20 17:12 ghost

@toprak En -> Any language would work perfectly. However, you have to find or make dictionary data and import it.

wtetsu avatar Dec 19 '20 18:12 wtetsu

Is it not necessary to rewrite the program just by importing the dictionary data?

yamadayo avatar Dec 23 '20 06:12 yamadayo

@yamadayo Please try it out. Maybe, it's not smart like English/Japanese parsing, but partially work.

If you need smarter parser for other languages, program improvement is required.

wtetsu avatar Dec 25 '20 15:12 wtetsu

Greetings. I've tried Spanish and didn't work at the accent alphabets e.g. "programación"

The example dataset:

{
    "lenguaje": "language as in choice of words\nprogramming language",
    "lenguaje de programación": "A programming language",
    "lengua materna": "mother tongue"
}

Screenshot:

ShareX_ScreenShot_4ddd57e5-0eb9-4ad6-950c-2945aa4dc5b8

I would suggest a workaround something like enabling user to map unsupported characters e.g. "ó" -> "o", "ñ" -> "n";

yuis-ice avatar Jan 28 '22 14:01 yuis-ice

@yuis-ice Thank you for the information. Let me consider it.

wtetsu avatar Jan 29 '22 05:01 wtetsu