FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

Deepspeed support and config file?

Open yzxyzh opened this issue 2 years ago • 1 comments

Hi:

Can this model be trained with Deepspeed support? If yes, could anyone provide a workable Deepspeed config file?

Thanks.

BTW, I have tried to use a simple config setting as below: { "zero_optimization": true } I then observed about 20% increase in training speed. Honestly speaking, it's far from what I would expected (double or even triple the training speed). I think it's because the config file is incorrect.

yzxyzh avatar Apr 15 '23 03:04 yzxyzh

The speedup is based on your gpu network topology and zero level / parallelization config. The zero optimization config is typically:

"zero_optimization": {
  "stage": $STAGE_NUMBER_YOU_WANT_FROM_0_TO_3
}

ZYHowell avatar Apr 26 '23 00:04 ZYHowell

It seems the training speed with Deepspeed isn't great. We'll add some better model-parallel training support soon. Closing this ticket.

zhisbug avatar May 08 '23 08:05 zhisbug