DeepSpeedExamples issues

Results 274 DeepSpeedExamples issues

Sort by recently updated

How to save memory during inference

Thanks for great work！ When I run my inference code below using `deepspeed --include localhost:0,1,2 inference.py --model opt-iml-30b --dataset WQSP` I meet the error **exits with return code = -9**...

Kangkang625

不同机器上python环境变量路径不同，deepspeed启动后发现找不到其他机器的python环境，如何解决

liqwertyu

The actor constantly generates ['</s>'] or ['<|endoftext|></s>'] after 200 steps in RLHF with hybrid engine disabled

settings: actor & critic: OPT 1.3b reward model: OPT 350m GPU: 4 * V100 32G running script: ACTOR_MODEL_PATH=$1 CRITIC_MODEL_PATH=$2 ACTOR_ZERO_STAGE=$3 CRITIC_ZERO_STAGE=$4 OUTPUT=$5 if [ "$OUTPUT" == "" ]; then OUTPUT=./output...

mousewu

About multiple-thread attention computation on CPU using zero-inference example.

Hi, I am trying to test the attention computation on the CPU with zero-interference. I use the following command to run the script. ``` BSZ=96 LOG_DIR=$BASE_LOG_DIR/${MODEL_NAME}_bs${BSZ} mkdir -p $LOG_DIR deepspeed...

luckyq

Suggested GPU to run the demo code of step2_reward_model_finetuning (DeepSpeed-Chat)

I follow the instructions of this [page](https://github.com/microsoft/DeepSpeedExamples/tree/master/applications/DeepSpeed-Chat/training/step2_reward_model_finetuning) to do step2_reward_model_finetuning with demo code. On the Google Cloud platform, I create one instance with a single V100(16GB) and another instance with...

wenbozhangjs

[REQUEST] More fine-grained distributed strategies for RLHF training

**Is your feature request related to a problem? Please describe.** We find that the generation stage of RLHF pipeline is time-consuming during the current training process. This is because the...

youshaox

RLHF problems when using Qwen model

I'm trying to use DeepSpeed-Chat stage2 scripts to do rlhf with Qwen1.8b-chat model，I change some parts in dschat and main.py to load my model, the most different part is: ```...

128Ghe980

DeepSpeedExamples
DeepSpeedExamples copied to clipboard

Metadata

Does Zero-Inference support TP?

extend max_prompt_length and input text for 128k evaluation

Deepspeed support finetune extra model with lora ?

How to save memory during inference

不同机器上python环境变量路径不同，deepspeed启动后发现找不到其他机器的python环境，如何解决

The actor constantly generates ['</s>'] or ['<|endoftext|></s>'] after 200 steps in RLHF with hybrid engine disabled

About multiple-thread attention computation on CPU using zero-inference example.

Suggested GPU to run the demo code of step2_reward_model_finetuning (DeepSpeed-Chat)

[REQUEST] More fine-grained distributed strategies for RLHF training

RLHF problems when using Qwen model

← Metadata

Owner

Metadata

DeepSpeedExamples DeepSpeedExamples copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepSpeedExamples
DeepSpeedExamples copied to clipboard