Patrick J. Burns
Patrick J. Burns
@PhVerkerk I have updated the installation instructions, dependencies (thanks @todd-cook and PR #5), and also made a note that the current installation to note that LatinBERT is tested on Python...
Thank you for trying the new setup—can you let me know what version of rustc you are running? (I have installed so far with 1.50.0 and 1.58.1.)
The Ensemble Lemmatizer will return the likelihood of possible lemmas depending on which of the sublemmatizers is used (e.g. frequency for training-data-based, i.e. EnsembleUnigramLemmatizer). Here is the example included in...
@todd-cook If helpful, I could write a sublemmatizer that uses POS info in addition to token/lemma info.
> Of course a more robust metric will probably use embeddings and context to pull the lemma forms in the right direction. @todd-cook Been meaning to implement this as well—perhaps...
Fixed by #1144?
Had just such an application for this today—if I figure out the best way to handle this, I'll submit a PR.
Happy to hear that it is working—sorry it was as difficult as it was. I will review this thread and see if the installation instructions can be made clearer. I...
> What do you think of the suggestion @diyclassics? @clemsciences Thanks for providing the custom class—happy to consider whatever features are useful to the community. The BackoffLemmatizer was always meant...
Adding #4 (re: homographs) to this thread as well...