hadoop-hdfs topic

List hadoop-hdfs repositories

cubefs

4.4k
Stars
634
Forks
Watchers

cloud-native distributed storage

seaweedfs

21.4k
Stars
2.2k
Forks
536
Watchers

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC activ...

MorphL-Community-Edition

260
Stars
36
Forks
Watchers

MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc....

dynamometer

129
Stars
36
Forks
Watchers

A tool for scale and performance testing of HDFS with a specific focus on the NameNode.

data-engineering-interview-questions

894
Stars
329
Forks
Watchers

More than 2000+ Data engineer interview questions.

sparksql-for-hbase

68
Stars
27
Forks
Watchers

Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers

datapipelines-essentials-python

53
Stars
35
Forks
Watchers

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformati...

Big_DataHadoop_Projects

49
Stars
37
Forks
Watchers

Big data projects implemented by Maniram yadav

console

22
Stars
4
Forks
Watchers

Open source data infrastructure platform. Designed for developers, built for speed.

TravelWebsite_BigDataAnalysis

20
Stars
1
Forks
Watchers

旅游网站(携程网部分数据)大数据分析-hadoop课程设计(本科课设级别)