big-data topic
presto-go-client
A Presto client for the Go programming language.
talaria
TalariaDB is a distributed, highly available, and low latency time-series database for Presto
kafka-connect-hdfs
Kafka Connect HDFS connector
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
arkime
Arkime is an open source, large scale, full packet capturing, indexing, and database system.
maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks