awesome-python icon indicating copy to clipboard operation
awesome-python copied to clipboard

Add markitdown

Open dadoming opened this issue 1 year ago • 0 comments

What is this Python project?

This is a utility tool for converting various files into Markdown (e.g., for indexing, text analysis, etc) At this moment, it supports: PDF, PowerPoint, Word, Excel, Images (EXIF metadata and OCR), Audio (EXIF metadata and speech transcription), HTML, Text-based formats (CSV, JSON, XML), ZIP files (iterates over contents)

What's the difference between this Python project and similar ones?

Most of the package comparisons I found, do the conversion from a specific format, like pdf or excel into Markdown. This does many from just this package. Microsoft recently open sourced the repository, so it will have maintenance and growth.

--

Anyone who agrees with this pull request could submit an Approve review to it.

dadoming avatar Dec 25 '24 19:12 dadoming