Alan May comments

Results 7 comments of


                                             Alan May

impossible to load vicuna-13B-1.1-GPTQ-4bit-128g

As a temporary solution, you can [convert the GPTQ 4bit model locally](https://github.com/qwopqwop200/GPTQ-for-LLaMa/tree/cuda#llama). I will test compatibility with other models released by TheBloke

Improve SSE User Experience

@VGEAREN I have made a similar modification before, but it has a problem that it is not compatible with the openai python sdk, because it will **send a ping event**...

[WIP] Fixe FSDP saving error

@merrymercy I can help with the test, since I had the same problem before. Update results later. --- update Try this PR with 4*A100(80G), training is ok, OOM when saving....

@merrymercy @zhisbug Tried several different settings using the FSDP API, all failed when saving the model. But based on [this comment](https://github.com/tatsu-lab/stanford_alpaca/issues/81#issuecomment-1494614864), I finally managed to save the model with **python3.10**...

Alan May

impossible to load vicuna-13B-1.1-GPTQ-4bit-128g

Improve SSE User Experience

[WIP] Fixe FSDP saving error

[WIP] Fixe FSDP saving error

关于中文预训练阶段的Loss情况咨询

FastChat - error on 4bit GPTQ

MPT support