Sudachi
Sudachi copied to clipboard
User dictionary source File Creation from Token and POS mapped file
if there any Utility to generate the “User dictionary source File” from a raw file ,which has Sentence and its Tokens and POS Mapping for Each Token . I mean if we have Token and POS mapping , if there any easy way to generate the “User dictionary source File”
For Example , if we have a file as below , or any similar format, can we generate the “User dictionary source File”
Do you want to implement an analyzer for Thai(?) only, or use it to analyzer mixed Thai(?)-Japanese data?
I want only for Japanese , the Earlier Example what I given by mistake Thai language , below is the Japanese Example . i can have in any format , but i can have below parameter
- Raw Sentence and its Tokens and POS mapping