unstructured-data topic

List unstructured-data repositories

fiftyone

6.8k
Stars
510
Forks
Watchers

The open-source tool for building high-quality datasets and computer vision models

nomic

1.0k
Stars
143
Forks
8
Watchers

Interact, analyze and structure massive text, image, embedding, audio and video datasets

base

28
Stars
3
Forks
Watchers

Adansons Base is a data programming tool for error-analysis of training results. It organizes metadata of unstructured data and creates and organizes datasets. It makes dataset creation more effectiv...

unstract

4.5k
Stars
349
Forks
34
Watchers

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

unstructuredio-haystack

16
Stars
2
Forks
Watchers

💙 Unstructured Data Connectors for Haystack 2.0

pipeline-backend

15
Stars
8
Forks
Watchers

⇋ A REST/gRPC server for Instill VDP API service

radient

264
Stars
10
Forks
Watchers

Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

llm-graph-builder

2.3k
Stars
360
Forks
20
Watchers

Neo4j graph construction from unstructured data using LLMs

extractous

47
Stars
2
Forks
Watchers

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

marly

90
Stars
7
Forks
Watchers

The Data Processor for Agents