big-data topic

List big-data repositories

presto-go-client

225
Stars
57
Forks
Watchers

A Presto client for the Go programming language.

talaria

198
Stars
31
Forks
Watchers

TalariaDB is a distributed, highly available, and low latency time-series database for Presto

kafka-connect-hdfs

476
Stars
396
Forks
Watchers

Kafka Connect HDFS connector

doris

11.6k
Stars
3.1k
Forks
Watchers

Apache Doris is an easy-to-use, high performance and unified analytics database.

arkime

6.2k
Stars
1.0k
Forks
Watchers

Arkime is an open source, large scale, full packet capturing, indexing, and database system.

maha

128
Stars
62
Forks
Watchers

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

eland

619
Stars
96
Forks
Watchers

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

zeppelin

6.3k
Stars
2.8k
Forks
Watchers

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

BigData-Notes

15.4k
Stars
4.2k
Forks
Watchers

大数据入门指南 :star:

spark-py-notebooks

1.6k
Stars
911
Forks
Watchers

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks