big-data-analytics topic

List big-data-analytics repositories

Movies-Analytics-in-Spark-and-Scala

90
Stars
52
Forks
Watchers

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

ydata-profiling

12.1k
Stars
1.6k
Forks
Watchers

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

metatron-discovery

433
Stars
108
Forks
Watchers

Powerful & Easy way for big data discovery

aut

133
Stars
33
Forks
Watchers

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Graph_Sampling

156
Stars
48
Forks
Watchers

Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.

v6.dooring.public

443
Stars
94
Forks
Watchers

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

lithops

307
Stars
97
Forks
Watchers

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

v6d

809
Stars
117
Forks
Watchers

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

EasyML

2.0k
Stars
441
Forks
Watchers

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.