indic_tagger icon indicating copy to clipboard operation
indic_tagger copied to clipboard

Support for Marathi POS Tagging encoded in WX format

Open AdityaYadavalli1 opened this issue 4 years ago • 1 comments

Currently, the support for CRF-based Marathi WX POS tagging seems to be broken. The following is the traceback I get.

Traceback (most recent call last):
  File "pipeline.py", line 203, in <module>
    pipeline()
  File "pipeline.py", line 184, in pipeline
    tagger.load_model()
  File "/home/asr/indic_tagger/tagger/src/algorithm/CRF.py", line 25, in load_model
    self.tagger.open(self.model_path)
  File "pycrfsuite/_pycrfsuite.pyx", line 571, in pycrfsuite._pycrfsuite.Tagger.open
  File "pycrfsuite/_pycrfsuite.pyx", line 733, in pycrfsuite._pycrfsuite.Tagger._check_model
IsADirectoryError: [Errno 21] Is a directory: '/home/asr/indic_tagger/models/mr/crf.pos.wx.model'

Basically, I think the model that is trained on WX text is missing. Please let me know when you can provide support for this.

AdityaYadavalli1 avatar Sep 29 '21 10:09 AdityaYadavalli1