Matthew Honnibal

Results 13 issues of Matthew Honnibal

I find this very interesting. My main project, the spaCy NLP pipeline, is written in Cython. Cython's great and terrible — it's a messy language, but it allows a lot....

We should consider adding a Pathy class that can support http/https endpoints. It wouldn't be able to do all operations, but we could do some, right? * `path.open()`: This could...

Hi, Thanks for the project! Like you I've been very dismayed by the state of the data being published. I don't understand why we can't just have a CSV with...

feedback

The workflow for setting up a pipeline component in code sometimes feels a bit rough. This came up while I was investigating #6958. Let's say we have some pipeline component...

enhancement
feat / pipeline
feat / ux

Hey, First, thanks for the kind words in various places :). I came across your posts, which led me here. I also spent quite some time working on similarity models....

There are lots of ways to write Cython. I normally suggest writing the optimised function with a pure C interface, declared "nogil". The nogil declaration tells the Cython compiler there...

The 'direct' option in 'spacy download' is supposed to only download from our model releases repository. However, users were able to pass in a relative path, allowing download from arbitrary...

enhancement
feat / cli

Add a method `Language.add_pipe_instance` that allows you to add component instances to the pipeline. This is primarily intended to make it easier to assemble a pipeline dynamically in a script,...

enhancement
feat / pipeline

I'd like to manipulate a `BatchEncoding` object returned from the `transformers` library, by dropping some of the tokens from it. Currently my plan is to compute a mask for the...

Stale

Experiment with using cibuildwheel directly