big-data topic
logging-flume
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data
accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
awesome-AI-kubernetes
:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Ku...
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
smooks
Extensible data integration Java framework for building XML and non-XML fragment-based applications
hazelcast-nodejs-client
Hazelcast Node.js Client