alphanlp

[email protected]

Results 13 comments of


                                            alphanlp

when i run the pinyin_main.py, get UserWarning like below

if it matter?

maybe i find a point should be change

it's very interesting, when i user softmax as proposed in paper, the loss can not down

when i use tensorlayer, there is a question

thanks

Encounter the runtime error training with lora and flash_attention together

> why qlora‘s loss is slower, i find the same question.

Checkpoint fails to load after training

does who solve the promblem?

I encountered a bug on importing "coati" while running "sh train_sft.sh" in "ColossalAI/applications/Chat/examples"[BUG]:

me too

[BUG] it took almost 1hour to :Initializing TorchBackend in DeepSpeed with backend nccl

me too? do you have solve the problem?

为APi添加代理

有共享的代理吗？没有海外服务器啊

in 1.3.0 Version, ROM语义相关性 model predict results is random

bert.embeddings.word_embeddings.weight: found shape torch.Size([21128, 768]) in the checkpoint and torch.Size([30522, 768]) in the model instantiated

When running Stage-3 scripts with enable_hybrid_engine encountered errors

same error using LLaMA as an actor when zero stage = 3

1
2
›