Jackson L. Lee

Results 21 comments of Jackson L. Lee

> @jacksonllee should we do a new PyPI release? Yeah, we should. It looks like there's a couple packaging / maintenance things we should update as well. Let me put...

Hello, apologies for the long silence. TL;DR -- I've looked into the Colaje corpus data. I've also just made a new release of pylangacq (v0.19.0) for some coincidentally related reasons...

> 之後可能會開個 PR 睇下點優化。 I'd recommend discussing what you guys have in mind before opening a PR. I've had a strong preference for (i) using only datasets and models that...

So you guys are talking about `pycantonese.pos_tag` but not `pycantonese.segment`? I got confused by [how this issue was first raised](https://github.com/jacksonllee/pycantonese/issues/33#issue-1395612893). > It seems like _AveragedPerceptron.predict() is essentially doing matrix multiplication...

> 唔好意思,我一直都係講 .segment()唔係.pos_tag() I've just made a new release of `wordseg` to speed up word segmentation, thanks to @ZhanruiLiang's tip. `pycantonese.segment` should now be a couple times faster than before....

Thanks for the note, @graphemecluster! I've just made a new release of `wordseg` (my own word segmentation package that pycantonese uses) to speed up word segmentation, thanks to @ZhanruiLiang's tip....

@ZhanruiLiang Thank you for looking into possibly using numpy to speed up POS tagging. Re: #34, ~I'll take a look and keep you posted~ EDIT: #34 has been resolved. Thank...

Both #32 and #34 have been resolved. @ZhanruiLiang The upstream `main` branch is now ready if you're still up for a pull request with your improved POS tagger code.

Circling back here -- The `wikipron` package version 1.3.0 has just been released to PyPI: https://pypi.org/project/wikipron/1.3.0/

Hello @milamarcheva, thank you for making this pull request! I haven't thought about how or whether to handle words annotated with a preceding 0, so this is a good opportunity...