distributed-training topic

List distributed-training repositories

Paddle

21.8k
Stars
5.5k
Forks
718
Watchers

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

HyperGBM

324
Stars
45
Forks
Watchers

A full pipeline AutoML tool for tabular data

HyperPose

1.2k
Stars
275
Forks
Watchers

Library for Fast and Flexible Human Pose Estimation

byteps

1.3k
Stars
62
Forks
Watchers

A high performance and generic framework for distributed DNN training

stylable

1.3k
Stars
62
Forks
Watchers

Stylable - CSS for components

hivemind

1.8k
Stars
139
Forks
Watchers

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

DeepRec

980
Stars
340
Forks
Watchers

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

Fengshenbang-LM

4.0k
Stars
374
Forks
Watchers

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

pytorch-image-models

31.9k
Stars
4.7k
Forks
291
Watchers

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...

FedML

4.1k
Stars
772
Forks
79
Watchers

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on a...