big-data topic

List big-data repositories

redislite

569
Stars
73
Forks
Watchers

Redis in a python module.

logging-flume

2.5k
Stars
1.6k
Forks
Watchers

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

accelerator

150
Stars
28
Forks
Watchers

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

awesome-AI-kubernetes

116
Stars
42
Forks
Watchers

:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Ku...

orc

661
Stars
469
Forks
Watchers

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

eel-sdk

144
Stars
35
Forks
Watchers

Big Data Toolkit for the JVM

orc

128
Stars
44
Forks
Watchers

An ORC file format reader and writer for Go.

smooks

386
Stars
355
Forks
Watchers

Extensible data integration Java framework for building XML and non-XML fragment-based applications

crunch

104
Stars
85
Forks
Watchers

Mirror of Apache Crunch (Incubating)