DeepSpeedExamples issues

Results 274 DeepSpeedExamples issues

Sort by recently updated

My deepspeed code is very slow

```2 pytorch allocator cache flushes since last step. this happens when there is high memory pressure and is detrimental to performance. if this is happening frequently consider adjusting settings to...

zhaowei-wang-nlp

Sd example opt

Applies optimization to the SD example: - Adds optimized_iteration flag. This flag determines which portion of the iterations to be optimized. For instance optimized_iteration = 0 means no optimization and...

PareesaMS

[chat] generate process is not a single step in RL

Hi I am from the ColossalAI team. I found that there are similarities between DeepSpeedChat and ColossalChat. We found that there might be some implementation error in our code, thus...

ht-zhou

question

deespeed chat

Request: Support for T5 models

Hi, do you plan on adding support for T5, UL2 models? Thanks!

TejaGollapudi

ValueError: optimizer got an empty parameter list

An error occurred when running pipeline_parallelism ValueError: optimizer got an empty parameter list

Deemo-cqs

The model size does not change

When I follow this [https://www.deepspeed.ai/tutorials/model-compression/#2-tutorial-for-zeroquant-efficient-and-affordable-post-training-quantization](url) run the zero_quant.sh or (quant_activation.sh and quant_weight.sh), the model size still is 418mb as the bert-base. ![image](https://user-images.githubusercontent.com/49281157/222089617-33fcb1cc-9aee-419a-8a02-9c366179d653.png) the clean_model weight still save as float32? Can...

Twilighter9527

XTC in DeepSpeed Compression does not work

Hi all, Thanks for great works. I ran some experiments with Deepspeed compression using configs in model_compression/bert. I got some issues: - Size of output model when using DeepSpeedExamples/model_compression/bert/bash_script/XTC/quant_1bit.sh config...

Toan-Do