nccl topic

List nccl repositories

bluefog

290
Stars
71
Forks
Watchers

Distributed and decentralized training framework for PyTorch over graph

cupy

9.0k
Stars
829
Forks
127
Watchers

NumPy & SciPy for GPU

nccl-fastsocket

102
Stars
13
Forks
Watchers

NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.

tutorial-multi-gpu

167
Stars
50
Forks
Watchers

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

large_language_model_training_playbook

441
Stars
18
Forks
Watchers

An open collection of implementation tips, tricks and resources for training large language models

llm_training_handbook

412
Stars
27
Forks
Watchers

An open collection of methodologies to help with successful training of large language models.

msrflute

180
Stars
22
Forks
Watchers

Federated Learning Utilities and Tools for Experimentation

pyDNMFk

20
Stars
6
Forks
Watchers

Python Distributed Non Negative Matrix Factorization with custom clustering

NCCL.jl

20
Stars
9
Forks
Watchers

A Julia wrapper for the NVIDIA Collective Communications Library.

NCCL

21
Stars
7
Forks
Watchers

Sample examples of how to call collective operation functions on multi-GPU environments. A simple example of using broadcast, reduce, allGather, reduceScatter and sendRecv operations.