DeepSpeedExamples issues

In instructGPT, during the RM training process, different <prompt, response> pairs of a prompt are put together to calculate the loss. Is this also implemented in DeepSpeed-chat?

BaiStone2017

question

deespeed chat

Could this tool apply for encoder-decoder model, like Flan-T5?

3

It's great tool to show how to build a chatgpt-like model based on some foundation models. I wonder could it support the encoder-decoder model, like Flan-T5? Could we just directly...

henryxiao1997

supervised finetune in chinese

1

I want to finetune bloom_1.1b in chinese dataset, and run run_chinese.sh. But in run_chinese.sh, where is the **ds_config.json** file.

18335100284

Unit in (model-only) latency

`mtimes` is multiple by 1000 to get the time in the unit of ms in `print_latency`, but it is already in the unit of ms. `use_cuda_events` is true by default...

fengyangyang98

Step 3 failed for customized GPT

1

Dear all, When we use our customized GPT model for step 3 training, we get the kernel execution error. (m: 5120, n: 8, k: 1706, error: 14), however once we...

ruihan0495

bug

deespeed chat

多机分布式训练，加载模型，报a leaf Variable that requires grad is being used in an in-place operation错误

2

使用deepspeed 多机分布式训练，加载opt-1.3b 模型的时候，报a leaf Variable that requires grad is being used in an in-place operation错误

sc-lj

Step2 training get a negative score and accuray is below 60%

1

Hi~ While running step2 reward model training, I got a strange result after one epoch training: ***** Evaluating reward, Epoch 1/1 ***** chosen_last_scores (higher is better) : -9.388486862182617, acc (higher...

dlnlpchenliyu

bug

deespeed chat

Wrong code for calculate score in step2 evaluation

In the [applications/DeepSpeed-Chat/training/step2_reward_model_finetuning/main.py](url) line 254-258 ``` scores += outputs["chosen_mean_scores"].mean().float() if step == 99: # For faster evaluation and debugging break acc = correct_predictions / total_predictions scores = scores / (step...

nepetune233

bug

deespeed chat

Model performance suprisingly bad

2

Dear all, We are trying to reproduce the results, however, as we follow the training steps, our chatbot is keep repeating a nonsense. We suspect that our RLHF part is...

ruihan0495

bug

deespeed chat

Fixed typo for --only_optimize_lora

I found a little of typo on some documetations&codes `--only_optimizer_lora` for `--only_optimize_lora`. So I fixed all the files. Thank you.

HyeongminMoon

DeepSpeedExamples
DeepSpeedExamples copied to clipboard

Metadata

In instructGPT, during the RM training process, different <prompt, response> pairs of a prompt are put together to calculate the loss. Is this also implemented in DeepSpeed-chat?

Could this tool apply for encoder-decoder model, like Flan-T5?

supervised finetune in chinese

Unit in (model-only) latency

Step 3 failed for customized GPT

多机分布式训练，加载模型，报a leaf Variable that requires grad is being used in an in-place operation错误

Step2 training get a negative score and accuray is below 60%

Wrong code for calculate score in step2 evaluation

Model performance suprisingly bad

Fixed typo for --only_optimize_lora

← Metadata

Owner

Metadata

DeepSpeedExamples DeepSpeedExamples copied to clipboard

Metadata

← Metadata

Owner

Metadata

DeepSpeedExamples
DeepSpeedExamples copied to clipboard