Jaemin Cho comments

Results 66 comments of


                                            Jaemin Cho

Config file and performance reproduce

Hi, the config file is adapted from the original config file of CLIP-RN50 transformer model. (https://github.com/clip-vil/CLIP-ViL/blob/master/CLIP-ViL-Direct/caption/configs/phrase1/transformer.yml). I only edited it with larger batch sizes and fp16 for faster training. Since...

Config file and performance reproduce

Back then I didn't use wandb, so I don't have log files for that run, sorry.

Config file and performance reproduce

I just remember that I actually ran the original CLIP-ViL training script to run the MLE model. Could you please run with the same batch size=10 for 25 epochs following...

Config file and performance reproduce

Yes

Config file and performance reproduce

For multi-gpus, I guess you could get the similar performance with fewer warmup steps, such as 1000 steps.

Config file and performance reproduce

Here I attach the output.log for the CIDER run. I used the same configuration (8 V100s, 25 batch size at each GPU) as the current config file. [cider_output.log](https://github.com/j-min/CLIP-Caption-Reward/files/9466584/cider_output.log)

[Errno 32] Broken pipe

It looks like the METOR evaluation is not properly set up in the [language_evaluation package](https://github.com/bckim92/language-evaluation). Have you run `python -c "import language_evaluation; language_evaluation.download('coco')"` as mentioned in [REAMDE #Setup](https://github.com/j-min/VL-T5/blob/main/README.md#setup)?

Jaemin Cho

Config file and performance reproduce

Config file and performance reproduce

Config file and performance reproduce

Config file and performance reproduce

Config file and performance reproduce

Config file and performance reproduce

[Errno 32] Broken pipe

ValueError: expected sequence of length 11 at dim 1 (got 18)

Can you provide a log file of lxmert pretraining

Can you provide a log file of lxmert pretraining