Steve Canny
Steve Canny
Hi @S1M0N38 can you provide an email that demonstrates this (mis-)behavior? I'm just finishing up a broad refactor of `partition_email()` that takes a different approach overall. I expect this problem...
How about running this `email.iterators._structure()` stdlib function on the message to give us a firm view of the MIME-part hierarchy: ```python import email import email.iterators MSG_FILE_PATH = "path/to/email.msg" with open(MSG_FILE_PATH,...
k, np. Closing for now then.
@S1M0N38 that was removed on purpose. @Paul-Cornell can you remove that parameter from the docs for us?
@S1M0N38 Ahh, okay, so that's a bug then. You shouldn't have to provide a custom partitioner for that :) Shall we make this into a bug report for that or...
@Paul-Cornell it looks like the text is about the same for `partition_email()` and `partition_msg()` on that page: --- > You can optionally partition e-mail attachments by setting process_attachments=True. If you...
Duplicate of #3102
Closing as inactive. Feel free to reopen if this is still a problem and you can provide a file that reproduces this behavior.
@ffma-nate-rogan are you seeing an actual performance bottleneck on this or you're proposing this on principle? By my reading, `check_for_nltk_package()` is called at most once with each set of parameters,...
We currently extract run text from inline text-box shapes along with the rest of the text in the paragraph to which the textbox is anchored. This behavior was added in...