Puyuan Liu comments

Results 24 comments of


                                            Puyuan Liu

[BUG] Can't load OPT-30B and OPT-66B through checkpoints.json

I got the same error with NousResearch/Nous-Capybara-34B, ``` File "/home/ec2-user/SageMaker/anaconda3/envs/ot-gpt-package/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 566, in from_pretrained return model_class.from_pretrained( File "/home/ec2-user/SageMaker/anaconda3/envs/ot-gpt-package/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3480, in from_pretrained ) = cls._load_pretrained_model( File "/home/ec2-user/SageMaker/anaconda3/envs/ot-gpt-package/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3870, in...

Puyuan Liu

[BUG] Can't load OPT-30B and OPT-66B through checkpoints.json

step1-sft use lora failed

step1-sft use lora failed

step1-sft use lora failed

Running multinode training and received unclear error for stage 2 training

ZeRO Stage 2 consumes more GPU memory than Stage 1

upsert is slow for sparse embeddings

upsert is slow for sparse embeddings

upsert is slow for sparse embeddings

upsert is slow for sparse embeddings