Kyle Bi comments

Results 10 comments of


                                            Kyle Bi

trafficstars

[Help] <title>p-tuning的prefix加在transformer哪些层？

> > > > 看源码，实现在所有层了。源码位置在modeling_chatglm.py 136行，和ptuning v2，代码一致。 ![企业微信截图_16844866285937](https://user-images.githubusercontent.com/48375360/239488643-819154ad-ee3e-4b71-899e-5a916e847cf8.png) 您好，多问一句，PrefixEncoder的forward函数的输入是什么？我看论文中的图应该指的是pre_seq_len的tokens作为输入？

[FEATURE]: ChatGLM model support

![image](https://github.com/hpcaitech/ColossalAI/assets/28759055/f6e7e84e-8c51-433c-807c-b33033f43af9) 哈喽楼主，问下在运行train_prompts.sh的时候，出现因为actor_optim和critic_optim中可优化的参数为空，导致zero的策略中flatten的操作报错的情况，因为flatten的参数不能为空。我在train_peft_prompts.py文件里把optimizer的参数都设置requires_grad=True也不行。

Kyle Bi

[Help] <title>p-tuning的prefix加在transformer哪些层？

[FEATURE]: ChatGLM model support

[FEATURE]: ChatGLM model support

[🌟Tutorials🌟] Use MiniGPT-4 in Google Colab or your computer | 在本地或者Colab上使用MiniGPT-4

[🌟Tutorials🌟] Use MiniGPT-4 in Google Colab or your computer | 在本地或者Colab上使用MiniGPT-4

有人成功了吗？

有人成功了吗？

关于训练过程一个batch内，text文本标签存在较多重复，是否会导致训练不收敛

[Feature]: Summarize the elapsed time of PyTorch ops in a training job.

请问文中提出的包含AVA, FLICKR- AES和TAD66的测试集可以提供吗？