MaoSong2022

Results 3 issues of MaoSong2022

CharXiv benchmark is a visual question answer (VQA) that is specified on Chart domain. CharXiv consists of two splits (val, test) and two modes descriptive and reasoning, this PR completes...

WIP

fix argument error in MMMU_Pro benchmark, close #929

[OpenAI GPT-4.1](https://openai.com/index/gpt-4-1/) uses [CharXiv](https://charxiv.github.io/) benchmark to show its vision capability. I belive CharXiv benchmark should be supported by VLMEvalKit. I am working on supporting this benchmark, any suggestions are welcomed...