binmakeswell

Results 39 issues of binmakeswell

[Colossal-AI](https://github.com/hpcaitech/ColossalAI) is a unified deep learning system for the big model era, which integrates many efficient techniques like multi-dimensional tensor parallelism, sequence parallelism, heterogeneous memory management, etc. By using Colossal-AI,...

The LAMB implementation of the PyTorch version you released is different from the official version of TensorFlow released by the paper author. According to the official implementation published in the...

### 🐛 Describe the bug When parallel is set to pipeline=4 and tensor=dict(mode='2d', size=4), the program will get stuck on initialization and no error message will be output. ### Environment...

bug

### Describe the feature e.g. Latent Diffusion

enhancement

### Describe the feature styleganXL是支持stylegan3,stylegan2ada的通用训练架构,代码也做了简化,用这个做案例会很棒,他家代码刚刚发布几周

enhancement

### 📚 The doc issue A step-by-step tutorial, eg. model.

documentation
enhancement
help wanted

### Describe the feature Update/maintenance/usage is not uniform, and the same named model file without description, may cause confusion to users, affecting user experience and development efficiency.

enhancement

### Describe the feature https://github.com/hpcaitech/ColossalAI-Examples/issues/76

enhancement

### Describe the feature Most users do not have 64 GPUs to try these config, maybe use less resource? https://github.com/hpcaitech/ColossalAI-Examples/tree/main/image/vision_transformer/hybrid_parallel

enhancement

### 📚 The doc issue Incomplete: Define your own parallel model

documentation