CERN Database and Analytics Group

Results 6 repositories owned by CERN Database and Analytics Group

dist-keras

623
Stars
170
Forks
Watchers

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

Hadoop-Profiler

24
Stars
10
Forks
Watchers

Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.

hdfs-metadata

56
Stars
18
Forks
Watchers

Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.

spark-dashboard

92
Stars
21
Forks
Watchers

Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

SparkDLTrigger

29
Stars
12
Forks
Watchers

Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"

SparkPlugins

78
Stars
15
Forks
Watchers

Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems...