njucckevin
njucckevin
I used the code in https://huggingface.co/OFA-Sys/ofa-large-caption to inference, it could generate good captions and everything is ok. But when I used this code to generate 5000 captions for MSCOCO test...
### Question Hi, I'm confused with the human demonstrations provided in https://github.com/stanfordnlp/miniwob-plusplus-demos. These demonstrations seem mussy, which has dozens of (eg: 20+) state contain mouse up/down and keyboard up/down in...
Hi, thanks for the good work. I wonder how the click accuracy and scroll accuracy is calculated in section 5.1. I can not find such code in main.py and action_matching.py....
``` [rank1]: Traceback (most recent call last): [rank1]: File "/home/nfs04/chengkz/VL-RLHF/src/vlrlhf/dpo.py", line 146, in [rank1]: dpo_trainer.train(resume_from_checkpoint=training_args.resume_from_checkpoint) [rank1]: File "/home/data_91_d/anaconda3/envs/chengkz_lvlm/lib/python3.10/site-packages/transformers/trainer.py", line 1885, in train [rank1]: return inner_training_loop( [rank1]: File "/home/data_91_d/anaconda3/envs/chengkz_lvlm/lib/python3.10/site-packages/transformers/trainer.py", line 2216,...