feat(epub): Add EPUB support
Addresses https://github.com/microsoft/markitdown/issues/88.
Adds new converter + new test.
@0xRaduan love this PR. We already have a dependency for HTML to text (markdownify) in the HTML convertor. Can you check if that would be sufficient?
Hey @gagb, sorry was on a long vacation, going to take a look right now...
Thanks for this, really looking forward to it!
Can we have an update on this? Thanks
cc. @gagb - do you think we can merge this?
i resolved all the merge conflicts as far as i can see
or also cc. @afourney, since I see you've been merging the latest PRs into main
@gagb - does this still await my response? any timeline for getting this merged?
@0xRaduan Apologies for the delay. We're a super small team, with several large projects (e.g., AutoGen). I'll work on getting this in, and conflicts resolved, this weekend.
Ok, on second glance, EbookLib is AGPL -- which is very strong copyleft. I'm not clear we can include it here. I can look for an alternative, or I can help you set it up as a 3rd party plugin that you can host. LMK
@0xRaduan I adapted this PR to not use ebooklib (as per above discussion). Admittedly, rather vibe-coded.
Please have a look at #1131 and let me know if it suits your need.
Closed in #1131
Thanks, closing this PR.
- Symferopolskaya 2L, Днепр, 49005, Украина