parameter-server topic
LightCTR
Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosophy of Parameter Server and Ring-AllReduce collective communicat...
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
XLearning-XDML
extremely distributed machine learning
ps
自己实现的深度学习训练框架,纯java实现,没有过多的第三方依赖,可分布式训练
GeoMX
GeoMX: A fast and unified system for distributed machine learning over geo-distributed data centers.
cirrus
Serverless ML Framework
OpenEmbedding
OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.
veloce
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
PetPS
PetPS: Supporting Huge Embedding Models with Tiered Memory
AdaPM
A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training