濮澍 issues

Results 7 issues of


                                            濮澍

The response is too short to extract answer on GPQA. What should I set to extend it?

`lm_eval --model local-chat-completions --tasks gpqa_main_cot_zeroshot --model_args model=Qwen/Qwen2-72B-Instruct,base_url=https://api.together.xyz/v1 --output_path ./gpqa/result/Qwen2 --use_cache ./gpqa/cache/Qwen2 --log_samples --limit 10 --gen_kwargs temperature=0.7,max_tokens=8192` Using this command, The Qwen2's result just end sooo weirdly like the image below...

Is there any code not using VLMEvalKit?

I'm testing my model not in supported VLM, is there any solution or code that I can modify to fit my own model evaluation?

Only noisy output video of sv3d_u

![image](https://github.com/user-attachments/assets/7ed9f432-2ebb-4fe7-8595-cd5ea75a9c43) Three days ago, it's all fine, however, when I rerun this today, things changed, it cannot generate good video now.

Does any emu2 series model support interleaved text and images generation?

I only see multimodal input with text output when using Emu2, any solution to generate text + multi-images?

Request for Instruction Tunning

I've noticed you mentioned interleaved text-image generation only able on instruction-tuned model, would you release in the future? or would you please release some methods of instruction-tuning?

Help with finetuning

If I want to finetune the model with my own dataset, what file should I modify? and how can I make the model generate multi images in one turn?

How do model output interleaved text-image with multimodal input?

Does the model require further finetune? I'm wondering why the playground use a 'for' loop to generate a story