hadoop topic

List hadoop repositories

kafka-connect-fs

110
Stars
79
Forks
Watchers

Kafka Connect FileSystem Connector

data-science-ipython-notebooks

26.5k
Stars
7.7k
Forks
Watchers

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...

Movies-Analytics-in-Spark-and-Scala

87
Stars
51
Forks
Watchers

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

big_data_architect_skills

457
Stars
170
Forks
Watchers

一个大数据架构师应该掌握的技能

pyhdfs

88
Stars
22
Forks
Watchers

Python HDFS client

APIJSON

16.7k
Stars
2.1k
Forks
Watchers

🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any co...

caelus

336
Stars
81
Forks
Watchers

Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs

interview-questions-collection

21
Stars
9
Forks
Watchers

按知识领域整理面试题,包括C++、Java、Hadoop、机器学习等

《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集。在实现层面上,我搭建了一个由五台服务器组成的微型 Hadoop 集群,并且用 Python 实...