Chunjiang Ge (葛春江) comments

Results 39 comments of


                                            Chunjiang Ge (葛春江)

Training new Vicuna based on fully open-source OpenLLaMA

However, Vicuna does not release the training data.

Fine-tuning Vicuna-7B with Local GPUs: RuntimeError: Expected is_sm80 || is_sm90 to be true, but got false

LLaMA 7b has 128 dim for each head, while flash attn support 64 dim for rtx3090 only. So llama 7b with flash attn may only run on a100 or h100.

Fine-tuning Vicuna-7B with Local GPUs: RuntimeError: Expected is_sm80 || is_sm90 to be true, but got false

> Are you sure? flash-attn v2 supports dim up to 256. I am able to use it on 3090 > > > FlashAttention-2 currently supports: > > > > 1....

请问什么时候可以开源对话数据？

> > > > 同学您好，感谢关注对话数据集，非常遗憾暂时没有公开对话数据集。如果研究者使用，需要在智源平台，具体可联系[[email protected]](mailto:[email protected])，将会给您详细使用说明。谢谢！你好，给您发邮件了，请您回复一下

There is a long gap between the validation accuracy of the dataset of vlmevalkit and the model paper

Hello, I find that for TextVQA dataset, LLaVA evaluation with with reference token like: What kind of beer is this?\nReference OCR token: NINK, NK, BOWING, CC, STON, SUE, ED, Sublimely,...

Does vlmeval support multi card inference and batch size > 1?

Thanks for your relpy! I would like to know what is the normal format of inference with batch size > 1? Should we deploy the model though like, vllm or...

Does vlmeval support multi card inference and batch size > 1?

https://github.com/haotian-liu/LLaVA/issues/754#issuecomment-1907970439 this issue build a fast inference method for llava, would you add this function for every benchmark in this repo? BTW, I find sglang may not support lora+base model....

I cannot register my own dataset

You could try to register it in the dassl package. This depends on dassl. What's more, I think recent dassl supports VLCS dataset. You could refer to https://github.com/KaiyangZhou/Dassl.pytorch.

### text data

Could you please elabrate on what kinds of text data and what kinds of task?

麻烦问一下，qwen 1.8B用的是chat版本的还是非chat版本的？

请问从原文来看是在pretrain阶段用了qwen的template进行训练的，是没有llava那种用plain template进行pretrain的阶段的是吧