bigdata topic

List bigdata repositories

big-data-rosetta-code

287
Stars
33
Forks
Watchers

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

kotlin-spark-api

445
Stars
34
Forks
Watchers

This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x

shifu

252
Stars
108
Forks
Watchers

An end-to-end machine learning and data mining framework on Hadoop

liteflow

161
Stars
61
Forks
Watchers

liteflow是一个基于任务版本来实现的分布式任务流调度系统

fpart

217
Stars
37
Forks
Watchers

Sort files and pack them into partitions

spark-r-notebooks

119
Stars
71
Forks
Watchers

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

amoro

719
Stars
251
Forks
Watchers

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

flink-notes

374
Stars
128
Forks
Watchers

flink学习笔记

WeDataSphere

639
Stars
157
Forks
Watchers

WeDataSphere is a financial grade, one-stop big data platform suite.