fastcore enable more robust multiple dispatch with `plum`

This PR is an experiment that builds fastcore.transform with the same interface (all tests passing) on top of plum instead of fastcore.dispatch. The ideal outcome of this would be a "yes/no" decision on whether to adopt plum.

It was mostly seamless except for:

The complexity in _mk_plum_func. I don't think the library was designed for dynamically adding plum.Functions to classes after the class body is evaluated. The workaround isn't too hairy though.
I couldn't find a good way to get plum's type conversion system to match the desired Transform interface, which is to automatically convert the result of a tfm method call to the same type as the input, so it's still relying on fastcore.dispatch.retain_type.

Other points worth noting:

I noticed that there aren't many code comments in these parts of the repo. I've added some here to guide the review process, but happy to remove them if preferred.
fastcore.dispatch convention was to use tuples of types as unions, plum's is to use Union. E.g. (str,list) becomes Union[str,list].
plum searches parent classes by default, which is great.
I replaced dynamic creation of tfm methods from _TfmMeta.__new__ with methods on Transform. I think this is simpler and haven't found a downside yet.
Dispatching classmethods, staticmethods, and normal methods requires a specific order (https://github.com/wesselb/plum/issues/35).
plum https://github.com/wesselb/plum/issues/41, though the author has said that it shouldn't be too difficult a fix.
plum does not work with autoreload (https://github.com/wesselb/plum/issues/38).

It's also worth considering whether plum enables new interfaces that would've otherwise been very difficult. For example, it supports parametric classes and hooking into its type inference system. I haven't given this much thought yet.

Jun 01 '22 09:06 seeM

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Jun 01 '22 09:06 review-notebook-app[bot]

Many thanks for looking into this! I think the prototype looks encouraging.

Could you please take a look at the CI failures? The fastai integration tests are failing. So if you clone the fastai repo and do a editable install of it, you can then make test in that repo to run the tests directly.

I noticed that there aren't many code comments in these parts of the repo. I've added some here to guide the review process, but happy to remove them if preferred.

Feel free to add comments anywhere that the code wasn't that clear, such that future readers won't have to do the digging that you did.

fastcore.dispatch convention was to use tuples of types as unions, plum's is to use Union. E.g. (str,list) becomes Union[str,list].

We're moving to unions too, so that's fine. We always write them as str|list however, and use futures annotations.

I replaced dynamic creation of tfm methods from _TfmMeta.new with methods on Transform

Great. I'd always prefer avoiding metaclasses where possible.

plus does not work with future annotations

That's a blocker, since we use them a lot. I'll set this PR to draft since we can't merge it until there's a release of plum with that fixed.

plum does not work with autoreload

That would be really nice to fix too.

Jun 03 '22 19:06 jph00

Thanks so much for the response!

I took a look at the integration test logs and looks like it’s the clash with future annotations 🥲

Will try see how hard it’d be to fix that as well as the autoreload issue, and try get in touch with plum’s maintainer too.

Jun 03 '22 19:06 seeM

@jph00 a few updates on progress here:

All fastai tests are now passing locally 🚀! (Except 20a_distributed which fails with OSError: [Errno 24] Too many open files. I suspect this is unrelated but I'll ask for help on Discord.)

This is thanks to plum now supporting future annotations! 🎉 @wesselb has been very responsive and supportive throughout (@wesselb - just wanted to say thanks, but please feel free to unsubscribe if you like). See this issue for details. There was also a small bug with operator.{attr,item}getter which was fixed very quickly as well.

Next steps

Atm this PR requires changes to the fastai repo, specifically to handle the convention of tuple annotations representing unions. @jph00 do you think it's worth fastcore supporting the tuple convention for a smoother release flow - or is it better to do concurrent branches/PRs on fastcore and fastai? In any case, I'll make the PR on the fastai repo next.

After having worked a bit more with these libs, I think I can clean up the current implementation some more. I also have a few more test cases I want to add to fastcore from the plum future annotations thread - fastai uses Transforms and dispatch in a few very interesting ways! This branch currently also has a bunch of TODOs hanging around.

There is one outstanding issue: _TfmMeta manually sets Transform.__signature__ which breaks the ability to dispatch a partial(Transform). I have a few ideas here.

There is ongoing discussion about getting plum to work with autoreload here. I suspect it might not affect our use-cases much but I haven't really dug into it yet. (We manually define plum.Functions for the tricky Transform behaviours. This issue mostly affects the dispatch tables used by a global plum.Dispatcher on class functions.)

Jun 07 '22 12:06 seeM

Hi @seeM I'm a plum contributor.

I'm chirping in because I saw this thread and noticed that you mentioned

I couldn't find a good way to get plum's type conversion system to match the desired Transform interface, which is to automatically convert the result of a tfm method call to the same type as the input, so it's still relying on fastcore.dispatch.retain_type.

Have you seen how return-type annotations work in plum? It seems to me they would solve this issue. See for example:

In [1]: import plum

In [2]: @plum.dispatch
   ...: def test(a: int) -> float:
   ...:     return 2*a
   ...:

In [3]: plum.add_conversion_method(type_from=int, type_to=float, f=float)

In [4]: test(1)
Out[4]: 2.0

Jun 07 '22 17:06 PhilipVinc

Hi @PhilipVinc, thanks for the response! I’m not at a PC atm so can’t share any code examples just yet

Yep, we’re using add_conversion_method atm. What I was referring to was that, if you create a fastcore.Transform from a function without a return annotation (a Transform can roughly be thought of as a plum.Function here), the result of the Transform call will be converted to the type of the first argument.

I don’t think there’s a neat way to configure plum’s type conversion system to achieve that.

I’m not even sure what’d be a good interface for that just yet 😄 perhaps to be able to provide a custom default convert function to a Dispatcher or Function? I’d like to give this some more thought. Does this make sense?

Jun 07 '22 18:06 seeM

Ah I see. In Julia's language (which is what inspired heavily plum), what you are trying to have would be

fun(x::T, y, z)::T where {T}

Type parameters are in general unsupported by plum, but I would love to see them supported. I think it would be quite a bit of work, however, to get them working. Probably using the recently introduced TypeVar machinery in typing, which is there for a similar necessity, would be a good starting point... But indeed, someone needs to put in the work for that.

Jun 07 '22 18:06 PhilipVinc

Ahhh yes that’s exactly it - can’t believe I missed that! Python does indeed support type variables now. I’d be keen to poke around plum to see what it would take to support them, but agree that’s definitely a longer term project.

Jun 08 '22 06:06 seeM

Just briefly commenting that experimental support for autoreload has been included in the latest release v1.6 thanks to #44 by @PhilipVinc.

On the topic of type parameters, it would be super awesome to have those included. I'm currently time constrained because I'm submitting my dissertation in the next month (ish), but I've planned a major overhaul of Plum's internals for after that to fix some longstanding suboptimal design choices made early on. If at all possible, I'd be willing to look into supporting type parameters in a perhaps initially limited extent.

Jun 09 '22 09:06 wesselb

Message ID: @.***>This is amazingly great progress, and so cool to see this wonderful collaboration bearing fruit!

Jun 09 '22 20:06 jph00

Agreed, many thanks to @wesselb and @PhilipVinc for the amazing contributions 😄 it's been a privilege working with fastcore and plum!

@jph00 this is now a drop-in replacement supporting all existing functionality, and ready for the GitHub action to run and for review if that passes (fastcore and fastai tests pass locally). It is still of course a prototype, so keen to hear your thoughts

Jun 11 '22 03:06 seeM

OK running the CI now - nice job on getting to this point so quickly!

Jun 13 '22 22:06 jph00

@jph00 https://github.com/fastai/fastcore/pull/415/commits/fce1adb2c6c42af1d9c0a005bfc83a5b820e9a51 fixes the Python 3.7 incompatibility that broke CI. Tests pass locally on Python 3.7

Jun 14 '22 00:06 seeM

I refactored the title and added label to this PR so that when/if it is merged it will show up in Change Log correctly

Jun 16 '22 20:06 hamelsmu

@jph00 friendly reminder this PR is ready for review

Jun 27 '22 15:06 hamelsmu

We chatted on Discord about extending this prototype to use plum everywhere, not just for Transform. I'm still working on that. I'll make this a draft PR again. Sorry for the confusion!

Jun 27 '22 23:06 seeM

@hamelsmu @jph00 this is ready for review. I suggest reviewing #424 and #425 first.

Jul 01 '22 11:07 seeM

@seeM there are some conflicts now, looks like you might want to merge master into this branch etc.

Jul 01 '22 15:07 hamelsmu

@hamelsmu Done!

Jul 02 '22 00:07 seeM

Amazing job @seeM :D

So, now that you've been on this journey, what do you think? Are there improvements in using plum over our home-made approach? Are there downsides?

There's still quite a bit of code here, despite the heavy lifting being pushed out to plum. Are there changes we could/should make to our interface to take better advantage of plum, and more closer to their API, rather than replicating our existing API?

Something to think about too: fastcore now has its first external dependency. Should be perhaps move this into a new project called fastdispatch?

Jul 02 '22 00:07 jph00

@jph00, TL;DR: Switching to plum would be a bet that multiple dispatch could lead to powerful use-cases in future. It wouldn't improve much today. Someone with experience using both fastai and multiple dispatch could better understand the likelihood of that bet paying off. Unfortunately I'm not yet in that position, but I'm happy to tinker. I'd love to hear your candid thoughts!

Overview

Pros:

The biggest improvement is multiple rather than dual dispatch. Whether this is worthwhile depends on how much multiple dispatch could improve fastai. I'm not sure, since I don't have sufficiently deep experience with either fastai or multiple dispatch. Since fastai's current design is heavily class-based, this may also require a large design shift. See Wessel's lab ("generic interface for linear algebra backends") for a really neat example of what's possible.
plum searches base classes by default, without having to use metaclasses as we do.

Cons:

The biggest downside atm is that we still have lots of code here. I think we can bring a lot of these extensions into plum, ending up with ~20 lines here (see the next section).
plum uses compiled C code, so we lose parts of the stack trace.
There are constraints to dispatching over static- and class-methods. See: https://github.com/wesselb/plum/issues/35. This hasn't affected any fastai use-cases.

Details

Bringing these extensions into plum

Here's a rough sketch of the code we'd need if we brought most of our extensions into plum (excluding fastcore's casting functions):

def _eval_annotations(f):
    "Evaluate future annotations before passing to plum to support backported union operator `|`"
    f = copy_func(f)
    for k, v in type_hints(f).items(): f.__annotations__[k] = Union[v] if isinstance(v, tuple) else v
    return f

class FastFunction(Function):
    @delegates(Function.dispatch)
    def dispatch(self, f=None, **kwargs): return super().dispatch(_eval_annotations(f), **kwargs)

class FastDispatcher(Dispatcher):
    function_cls = FastFunction
    @delegates(Dispatcher.__call__, but='method')
    def __call__(self, f, **kwargs): return super().__call__(_eval_annotations(f), **kwargs)

typedispatch = FastDispatcher()

We currently extend plum as follows:

More concise repr(Function). There's already an issue: https://github.com/wesselb/plum/issues/47.
Evaluating future annotations before passing to plum to support backported | operator. Currently by overriding Dispatcher.__call__ and Function.dispatch - I think that's fine as is.
Function.__getitem__ is a convenient wrapper of Function.invoke which was to avoid a breaking change for fastai's show_batch. It could instead use invoke (see the next section).
Use a custom Function in Dispatcher (to support points 1-3). plum doesn't provide a nice hook for this so we have to bring in the entire Dispatcher._get_function and change one symbol.
New feature: Dispatcher.to for late dispatching to class instance methods. I suspect this would be a useful feature to be brought into plum as well, and that there is a simpler way to implement it if I had a better understanding of plum's internals.

Using `invoke` instead of `getitem`

We could use Function.invoke instead of FastFunction.__getitem__. While the latter is convenient, I don't know if it's necessary over the more explicit former. This is currently only used by fastai's show_batch.

That is:

# This
show_batch[object]
# Becomes
show_batch.invoke(object, object, object)

The three objects are needed because the base show_batch is defined with three positional non-defaulting parameters, and plum dispatches over all of them.

We could also use keyword-only parameters to indicate which shouldn't be dispatched:

@typedispatch
def show_batch(x, y, *, samples, ctxs=None, max_n=9, **kwargs): pass

Then we can do:

show_batch.invoke(object, object)

Jul 02 '22 08:07 seeM

Wow fantastic explanation @seeM , thanks so much.

As a research lab, I think we should be focussing on things that allow us to do interesting research. So this change fits our mission.

Can you provide more detail re "plum uses compiled C code"? I don't see C code in the plum repo.

My suggested path is:

Move this PR to instead create new modules in this new repo: https://github.com/fastai/fastdispatch . I've made you an admin. You could use nbprocess_new to get it started, and then add a notebook to it with this functionality
Change fastcore's typedispatch to use invoke instead of __getitem__, and for now put a deprecation warning of __getitem__ and have it call invoke
In a case where there's no type annotation, could plum be changed to not require an explicit object in that position when calling invoke? That would maintain compatibility with fastcore typedispatch AFAICT, and IMO is a better dev UX
Move stuff from this new fastdispatch to plum for anything that @wesselb is OK with. For other stuff, leave it in fastdispatch
Once the previous step is done as best as we can, change fastai to use fastdispatch. If at this point there's almost no code in fastdispatch, move that code into fastai and delete the fastdispatch repo, adding a dep in fastai to plum (and otherwise we'll make fastai dep on fastdispatch)

How does that all sound? And @wesselb how do you feel about the ideas for stuff that @seeM mentions above?

Jul 03 '22 03:07 jph00

@jph00, thanks for the kind words and the thoughtful suggestions!

plum.function is compiled with Cython[^cython]. I'm not sure how much of a performance boost this gives fastai since it's a high-level library, and whether it's worth losing inspect support and stack frames. We might want to turn it off if possible.

I'm happy to move forward as you suggested. I have a few questions though:

I'm a bit confused about the dependency graph. Would both fastcore and fastai depend directly on fastdispatch? Since fastcore.transform depends on fastcore.dispatch, and fastai depends on both of those.
Is the idea to leave the existing dispatch implementation in fastcore for now?
Do you mean "change ~~fastcore~~fastai's usage of typedispatch to use invoke instead of __getitem__"? The only current use-case of __getitem__ is in show_batch in fastai.
I'll need to give some more thought to your invoke suggestion. I'm finding it tricky to think through its ramifications. There was an interesting discussion about removing invoke from Julia. Jeff Bezanson (co-creator of Julia) argued that most uses of invoke could be better addressed by having another function with its own name - in our case, show_batch_default. I quite like that too. The proposal fell through though.

[^cython]: plum.function is included as an extension in setup.py. setuptools checks whether Cython is installed and if so, uses it to build extensions. Since Cython is also included as a build dependency in pyproject.toml that'll always be the case.

Jul 04 '22 05:07 seeM

Oh I see - it's not really using any of the typed features of cython, but just doing some AOT compilation with it. I don't think that's of any benefit to fastai, so if there's an easy way to turn it off, I think that's a bit better.

I'm a bit confused about the dependency graph. Would both fastcore and fastai depend directly on fastdispatch? Since fastcore.transform depends on fastcore.dispatch, and fastai depends on both of those.

fastcore.transform would be moved to fastai. That's where it used to be.

Is the idea to leave the existing dispatch implementation in fastcore for now?

Yes, and give it a deprecation warning.

Do you mean "change fastai's usage of typedispatch to use invoke instead of getitem"? The only current use-case of getitem is in show_batch in fastai.

Yes.

Jul 04 '22 06:07 jph00

BTW I wonder if it would be worth adding a "plum-purepython" package to pypi and conda -- one that doesn't use C extensions? That would be a bit easier to deal with IMO.

Jul 04 '22 06:07 jph00

@jph00 Sorry, I totally missed your reply!

Double-checking that my understanding is correct:

[x] This PR's code moves to fastdispatch
[ ] As much of fastdispatch functionality as @wesselb is OK with is brought into plum
[ ] fastcore.transform moves to fastai, refactored to use fastdispatch
[ ] fastcore.dispatch remains, but raises a deprecation warning (perhaps pointing to fastdispatch)
[ ] fastcore would still have no external dependencies

I like the plum-purepython idea. I'll move that discussion to plum's issue tracker.

Shall we close this PR then?

Jul 05 '22 22:07 seeM

Message ID: @.***>That's all correct.

In addition, I think we should aim to eventually get rid of fastdispatch, if possible, by moving all the general stuff to plum, and everything else to fastai.

I thought of a better idea than creating a "plum-purepython". Instead, I think that cython should simply be removed from deps in plum's toml. That way, no dep on an installed compiler toolchain is required, but for people that have it, it'll be used. (Or at least, that's my understanding...)

Oct 11 '22 07:10 jph00

Hey @jph00! Just a quick clarification that all Cython compilation has since been removed from Plum entirely, since the gains were only minor. Plum is now a pure-Python package and doesn’t have Cython as a dependency anymore. It would, however, definitely be possible to reintroduce compilation whenever the user happens to have Cython installed.

Oct 18 '22 06:10 wesselb

Thanks @wesselb - I've reopened this now. We're currently focused on the latest fast.ai course, but we'll come back to this PR in the new year.

Oct 18 '22 06:10 jph00

Best of luck with running the latest fast.ai course!

Coming back to this in the new year is perfect timing. Plum should undergo some major improvements between now and then.

Oct 23 '22 12:10 wesselb

enable more robust multiple dispatch with `plum`

Next steps

Overview

Details

Bringing these extensions into plum

Using invoke instead of __getitem__

Using `invoke` instead of `getitem`