AdvancedLiterateMachinery
AdvancedLiterateMachinery copied to clipboard
Can MGP-STR deal with Chinese text? Can I train MGP-STR with Huggingface version?
trafficstars
Hi AlibabaResearch,
I have few questions listed as below:
- Can MGP-STR deal with Chinese text with code in GitHub or Huggingface?
- Can I train MGP-STR with Huggingface version?
Thanks.
Hi, Currently, MGP-STR is unable to process Chinese as the model has not been trained on Chinese data, and we have not found an effective method for segmenting Chinese words. If you have discovered one, we welcome the exchange of ideas.
The version on Huggingface is only capable of inference; for training purposes, you may refer to the instructions provided on GitHub.