Jiawei Zhao

Results 8 issues of Jiawei Zhao

请问为什么不用transformer的decoder部分呢?

Hi weijun, congratulations! I'm really interested in your code and can't wait to try it out. So when are you going to release the code? Thank you! The paper was...

Hi, what is the effect of the following operations? Why not just take the mean value of each class vector, like > out2=out2.mean(-1) ![image](https://user-images.githubusercontent.com/16524582/106453322-13979180-64c4-11eb-973f-d4caf9450371.png) Thanks!

Great work! Could you share the visualization scripts for the following pic? Thanks! ![image](https://user-images.githubusercontent.com/16524582/234526644-ea87106a-94e9-4c65-81ab-a30fdb3c50c2.png)

Great work! As video conversations in the instruction dataset have only one round in this version, if I want to train and test multiple rounds of video conversions, what should...

install DCNv3 with DCNv3-1.0+cu113torch1.12.1-cp38-cp38-linux_x86_64.whl my torch version: '1.12.1+cu113' import DCNv3 meets the following errors: ImportError: libc10.so: cannot open shared object file: No such file or directory ![image](https://github.com/OpenGVLab/InternImage/assets/16524582/0628f563-7a70-4e74-a64f-830479ac015e) Looking for help!...

按自定义数据格式,训练DPO在Map时报错 File "ms-swift/swift/trainers/dpo_trainer.py", line 114, in tokenize_row if len(answer_tokens['prompt_input_ids']) + longer_response_length > self.max_length: KeyError: 'prompt_input_ids' 打印了下answer的key:dict_keys(['input_ids', 'attention_mask', 'prompt_inputs_embeds', 'prompt_attention_mask']) 训练代码: CUDA_VISIBLE_DEVICES=2 \ swift rlhf \ --rlhf_type dpo \ --model_type internvl2-4b...

bug

比如InternVL中的dynamic_image_size和max_dynamic_patch?