Junyang Lin comments

Results 173 comments of


                                            Junyang Lin

Cannot load model parameters from checkpoint ../../checkpoints/ofa_base.pt; please ensure that the architectures match.

Wait wait... Aren't you using `train_caption_stage1_base.sh` but instead `train_caption_stage1.sh`? I think that is because of the script. The arch of `train_caption_stage1.sh` is `ofa_large`, and thus you can't load a base...

finetuning with caption_cn_large

Try gradient accumulation with `--update-freq`

关于device_map的问题

`device_map='auto'` will automatically enables your model to run on multiple GPUs. If you would like to use only 1 GPU, you can set `device` or set the environment variable like...

Setting for API key in the UI

Being frozen is quite necessary. I may prefer that people first finish the setup first, and then run the whole task (now it seems that everything is still a single...

about Dataset

Sorry, we do not have the permission.

No such file: '/home/linjunyang/multilabel_rcv/topic_sorted.json'

Create a json file for your label set dictionary, or use the one I just uploaded.

请问能不能提供一下基准模型的代码呢

修改配置文件或者代码都可以实现

Wrong: _, term_width = os.popen('stty size', 'r').read().split() 问题

这个可能和python版本有关系，你要不吧这行注释掉，然后随便设个term width，比如80

What does the para. schesamp mean?

schesamp refers to schedule sampling and schedule refers to the schedule for learning rate decay

请问data里面为什么没有词向量呢

都是随机初始化，没有预训练