Steve Canny
Steve Canny
@sidatcd can you provide me a fresh stack-trace? I can't make any sense of the one earlier in the thread, possibly because of its age. Also, do you have reason...
Okay, looks like PPMs are coming from `pdftoppm` (part of `poppler`) as part of the process, so that explains the `ppm` bit anyway.
@sidatcd Unfortunately I am unable to reproduce this with the PDF earlier in the thread. I'll have to close it for now because it's not actionable. If you are able...
@sidatcd Regarding the images, that threw me at first too. But what's happening in this step is the entire PDF document is being _rendered_ to a series of "page" images...
Closing as inactive.
Closing as inactive, assumed resolved as cannot reproduce.
Closing this as not actionable. @02deno there is nothing to be done about this without developing a more sophisticated re-rendering procedure. One avenue you might explore is opening and saving...
@rileym99 the general gist is: - figure out if PowerPoint can do it and then how how it does it. This involves using the PowerPoint UI to do it and...
@CaptKludge note that when you do that, any shapes on the slide layout become "background" shapes on a slide created from that layout and cannot be accessed or of course...
@MthwRobinson Turns out this behavior is produced by the `markdown` package, which is what `partition_md()` uses to convert Markdown to HTML. ```python md_text = """ My list - item 1...