unstructured-data topic

List unstructured-data repositories

towhee

3.0k
Stars
239
Forks
39
Watchers

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

erlexec

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

docarray

2.8k
Stars
222
Forks
Watchers

Represent, send, store and search multimodal data

bootcamp

1.7k
Stars
540
Forks
Watchers

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

awesome-document-understanding

1.2k
Stars
133
Forks
Watchers

A curated list of resources for Document Understanding (DU) topic

pygrok

274
Stars
76
Forks
Watchers

python implementation of jordansissel's grok regular expression library

instill-core

2.1k
Stars
90
Forks
Watchers

🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

nucliadb

587
Stars
45
Forks
Watchers

NucliaDB, The AI Search database for RAG

dkm

95
Stars
6
Forks
Watchers

Dynamic Kernel Matching (DKM) for Classifying Data with Non-conforming Features

Bracmat

47
Stars
5
Forks
Watchers

Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.

relevanceai

103
Stars
19
Forks
Watchers

Home of the AI workforce - Multi-agent system, AI agents & tools