document-parsing topic

List document-parsing repositories

unstructured

8.6k
Stars
702
Forks
43
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

edenai-apis

374
Stars
53
Forks
Watchers

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

papercast

32
Stars
1
Forks
Watchers

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...

community

19
Stars
6
Forks
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

docling

16.4k
Stars
834
Forks
81
Watchers

Get your documents ready for gen AI