document-parsing topic

List document-parsing repositories

unstructured

7.0k
Stars
528
Forks
43
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

edenai-apis

374
Stars
53
Forks
Watchers

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

papercast

32
Stars
1
Forks
Watchers

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...

community

19
Stars
6
Forks
Watchers

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.