Erez Shinan comments

Results 495 comments of


                                            Erez Shinan

A streamable parser

Hello @swamidass and @MegaIng , I wonder if maybe the best solution is to use [mmap](https://docs.python.org/3/library/mmap.html). I imagine it's the most efficient option, and we already support `bytes` as input,...

A streamable parser

But if unicode is a must, I don't see a better approach than what MegaIng presented, which is to constantly grow your text buffer as the lexer advances. The obvious...

Implement cache for earley parser

Yes, we'll be happy to accept a PR that extends the existing interface to support Earley. I can't promise how soon I'll get to it, since I have other obligations,...

Implement cache for earley parser

I see, that makes sense. I forgot it happens internally, since that function is called for LALR anyway too. Perhaps it should be refactored out. Another thing I recently noticed...

Partial parsing

It does build the parse-tree incrementally. You can find it at `ip.parser_state.value_stack`, where `ip: InteractiveParser`. But it might be a bit tricky to use it effectively.

Partial parsing

Like you said, tokens have a type and a value. The value is the actual bit of text being parsed, and the type is the category of that text. The...

Contextual Lexer Leaking "Spam" Terminals

@davidmcnabnz Yes, the Earley parser is harder to debug. I wouldn't say impossible. The "problem" with Earley (which is also why it's so powerful) is that it doesn't know what...

Contextual Lexer Leaking "Spam" Terminals

Congrats on the milestone! Sounds interesting, what would the API look like, more or less?

Contextual Lexer Leaking "Spam" Terminals

@davidmcnabnz Hard to say without seeing it first! Maybe it's safer to put it in a separate repo, and we can always copy it to the main one afterwards.

Contextual Lexer Leaking "Spam" Terminals

@davidmcnabnz I don't think closures are the worst thing, but I agree it was nicer if there was a more relaxed way to do it. But at the same time,...