mmark WIP feat: add monadic support

This PR would add support for monadic transformation and rendering.

The main motivation is to give the possibility for transforming and rendering in a certain monad (as requested in #46 ). For example, one might want to call an external service to render a specific content.

The scan alternative (i.e. doing a scanning beforehand, then send the results to the rendering extension) is not enough as scanning only consider the original content, not the one that might get constructed from the content and the application of other extensions. Adding the possibility to scan over the transformed content would solve that problem, but leave something to be desired (intermediate information generated, having some internals hidden but the transformed content publicly visible...).

This PR would solve the problem by introducing monad support for MMark, support that is already present in Lucid.

While the code should not introduce major breaking changes, there's still some open question regarding performance and ease of use (MMarkM m might seem a little weird, just as the parse function that just doesn't care about the monad).

Is this PR worth investigating?

Dec 08 '20 18:12 michivi

Thanks! I did not forget about this PR. I'll review it when I have time, probably this weekend.

Dec 14 '20 11:12 mrkkrp

No worries. :-)

My thoughts on the PR: Though I can successfully use the monad for side-effects, it seems weird to have to mention a monad at all when using MMark (well, the newly introduced MMarkM m actually). As it is, extensions are only used for rendering.

Perhaps it would be more interesting to split MMark into two separate data structures: one for the actual blocks and YAML data, and another one for the extensions. The usage of the PR would be cleaner as no type parameter would be required for the former, but some effort will go into backward compatibility. The extensions would only be required when rendering. Scanning can use a monad which can be completely different from the one given in MMarkM.

At least, I believe it would simplify my usage of the library :-)

Dec 16 '20 08:12 michivi

I looked at this today. To be frank I'm reluctant to follow this path because it'd result in duplication of almost the entire API.

For example, one might want to call an external service to render a specific content.

But this is still possible even with the current code, right? You just need to perform all your effects before you start rendering. This way you can prepare all necessary information (e.g. fetch something via HTTP) and then use via the extension mechanism. The only use I can see for introducing a monad is for manipulation of a state during rendering, e.g. if you want to assign an integer per link.

The scan alternative (i.e. doing a scanning beforehand, then send the results to the rendering extension) is not enough as scanning only consider the original content, not the one that might get constructed from the content and the application of other extensions.

It looks like the trans/render extension-constructors could allow the user to inspect inlines constructed so far (right now we only show the original inlines inside the Ois wrapper). If this is indeed what is desired, it would be a more elegant enhancement, more in the spirit of the library. I think what I tried to do is to avoid tangling and interweaving of extensions. The way it is done is that you have the HTML rendition constructed so far and you can add something around it (or just before/after), but you only can inspect the original inline to decide what to do. If it weren't the case extensions could start interacting in a confusing and hard-to-debug way. For example, you could have an extension that transforms links. Then you could have another extension that creates a navigation form. You may or may not want to transform the navigation links in the same way you transform links that come from the original document. Right now the behavior is such that the navigation links will essentially be out of reach for other extensions and the code that adds them should decide their final appearance and properties. Granted, I see how this can turn out to be limiting for certain applications, but I think that the idea is sane.

Jan 11 '21 18:01 mrkkrp

I agree with your vision for simplicity, and that is completely in line with the package philosophy.

Just for completeness, here was my goal. For my (very specific) use case of rendering some specific blocks using external services, the usage just seemed weird.

First translate some blocks into other blocks (translate the content into the dialect of external service B using service A)
Then transform those translated blocks using the external service B

I can't just use the MMark API as is, as I need some side-effects. Thus, the steps I thought I would have to follow with scanning would then be:

Parse the markdown document
Scan the document for those specific blocks
Generate a mapping for those blocks using extension and service A
Add extension A with those mappings
Scan the document for the mapped blocks (<-- won't actually find any as scan only sees the original blocks)
Generate a mapping of those blocks into the final blocks using extension and service B
Add extension B with those final mappings
Purely render to HTML

This solution doesn't work as scanning only sees the original blocks. And I wanted to keep extensions A and B separate for modularity and reusability.

One of the points of this PR was to know if there was another way, preserving composition and reasonability. Monads would allow for a single pass, but agreed, in exchange for complexity and heavy API modifications. I concur that the PR may not be the best idea :-) But I don't know if bad interactions are possible this way, so long as they only see the currently transformed structure (just as with function composition?). Extensions would still be testable independently. And in some cases (such as here), interactions might just be what we are looking for. In all cases, I believe the user may be able to decide using the order of the extensions?

I also have to admit that this is clearly not the everyday use case. For those type of cases and to keep the API simple, wouldn't it be possible to keep the existing API as is (not introducing a constructor to access the tranformed content), but have and expose an intermediate data structure for MMark?

The process could be:

Input --> MM Parsing --> Intermediate MD --> Custom block rendering --> Intermediate MD --> MM Processing --> HTML
                               ^                      ^                                           ^
                               |                      |                                           |
                    Parsed and transformable          |                              Existing MMark processing
                           Markdown                   |
                                              External service

The upside is that the client is free to do whatever it wants with the structure before rendering, including side-effects. Or it can also use the existing API, preserving performance. The downsides are that some internal structure would be exposed, and the line with this and extensions would be kind of blurry...

Jan 12 '21 09:01 michivi

mmark mmark copied to clipboard

WIP feat: add monadic support

mmark
mmark copied to clipboard