spacyr icon indicating copy to clipboard operation
spacyr copied to clipboard

Matching a pattern for dependencies

Open mitramir55 opened this issue 4 years ago • 4 comments

Hi everyone. Recently, I built a model that can detect passive voice in the records of a dataset using SpaCy dependency parsing, rule-based matcher, and nlp.pipe(). Here's the code for the project for more clarity.

Now, my question is, is there any way I can use Spacy matcher for this task in R using SpaCyr? I couldn't find any specific thing myself but I think there may be some ways I can construct something with a similar functionality myself.

I also could make use of n_processes in nlp.pipe() function as the dataset I'm working with is quit large. So if anyone has any idea about this too I'd love to hear it!

Thanks in advance!

mitramir55 avatar Jan 19 '21 13:01 mitramir55

Thanks for sharing that. We could pass through arguments to Matcher(). @amatsuo what do you think?

Also as a separate issue, using n_processes sounds like the answer to a bottleneck we've noticed for some time too: large corpora tend to be hard to process all at once.

kbenoit avatar Jan 20 '21 15:01 kbenoit

Hi @kbenoit and @amatsuo, any updates on these functions? I'm willing to contribute to expanding and building the tools if you needed any help.

mitramir55 avatar Oct 31 '21 18:10 mitramir55

Just picking this up - sorry it took so long - but @mitramir55 if you wanted to contribute any code for this, that would be great. spacyr needs some attention generally in light of updates to spaCy and new developments in the reticulate package.

kbenoit avatar Jan 05 '22 12:01 kbenoit

We will keep this open and tag it as Wishlist.

kbenoit avatar Sep 01 '22 09:09 kbenoit