tokenmonster icon indicating copy to clipboard operation
tokenmonster copied to clipboard

Update on multilingual

Open kerighan opened this issue 1 year ago • 2 comments

Is there any update on the multilingual tokenizers? The project seems to be on pause.

kerighan avatar Feb 27 '24 11:02 kerighan

You can get the binary/compile from source to train your own, I think the scope of the project is pretty good for production.

nampdn avatar Apr 26 '24 03:04 nampdn

hi @nampdn would please guide me how can i do that for bangla language , I am technical but newbie in core NLP domain. Help will be much appreciated

asifshaikat avatar Apr 30 '24 06:04 asifshaikat