Vik Paruchuri

Results 136 comments of Vik Paruchuri

@tungsten106 Thanks for much for this! It was on my list of functionality to add soon. I'll take a look next week (after the holiday).

@tungsten106 I'd love to review this, but the diffs seem to have issues (entire file is shown as deleted, with all the lines also shown as added). I'm having a...

It will skip the images currently. It's possible to save the images separately and embed them into the markdown, though - what's your usecase for the images?

It will extract tables, just not images. Can you tell me more about what you're using the images for?

If anyone wants to contribute this with a PR, I'd be very excited to review. I'm working on improving some of the base models and making marker fully open (it's...

I'm training a model to extract images - this will be integrated into marker

Image extraction will be coming in the next version (should be shipped in the next 2 weeks).

Just added this into the dev branch - https://github.com/VikParuchuri/marker/pull/111 . I'm going to close this issue, since the feature will land in master soon (next few days).

This wouldn't be hard to add. You can look at the `max_pages` flag as an example. Basically it's passed into [this function](https://github.com/VikParuchuri/marker/blob/master/marker/extract_text.py#L102) to restrict the page count. You could instead...

Thanks for the test case. I'll look into this. The ordering model should figure out the column count, but it may be misclassifying these pages.