Hao Zhang
Hao Zhang
Closing this PR, I am going to start a new PR with the fix.
what do you mean by stream feature? aren't our current CLI and web interface both streaming?
I am not sure we can directly use the method `trainer.deepspeed`? since our train.py script does not use any deepspeed functionality
@samarthsarin Please provide more information (full stack trace); it is hard to help by only seeing an assertion error.
you are running out of memory.
@lw3259111 Just try to get a clarification -- are you training your own llama 33B using deepspeed?
Do you mean the openAI-like API we added recently or something else?
Btw, we do not hold a public API service.
@wujunjiesd I think so, see this PR: https://github.com/lm-sys/FastChat/pull/663