tiesanguaixia issues

Results 15 issues of


tiesanguaixia

Why I met this error "IndentationError: unexpected indent" when running the 2nd step of "training"?

I ran this code in README.md ``` usage: train.py [-h] [--backbone {resnet,xception,drn,mobilenet}] [--out-stride OUT_STRIDE] [--dataset {pascal,coco,cityscapes}] [--use-sbd] [--workers N] [--base-size BASE_SIZE] [--crop-size CROP_SIZE] [--sync-bn SYNC_BN] [--freeze-bn FREEZE_BN] [--loss-type {ce,focal}] [--epochs...

[BLIP 2] How to put the pretrained Q-Former in BLIP-2 into my own model?

Thank you for your great work! I also want to ask how can I load the weight of _Q-Former_ weight only? Is the any code for reference? Looking forward to...

How to train and evaluate the model on the Training-7k split?

Thank you for your great work!

您好，急问为什么执行inference demo时，对一定数量的视频生成caption之后程序就会卡死

我把所有视频放在列表里面去遍历，依次生成caption. 但十几个视频之后程序就会卡在输出caption的answer那里。请问这是为什么呢？非常感谢！

【急问】PaddleOCR对单张图片做文字识别，设置use_gpu=True的情况下出结果要大致15秒。这个速度是正常的吗？

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem 【没有报错】 - 系统环境/System Environment：**CUDA 11** - 版本号/Version：Paddle：**paddlepaddle==2.5.2** PaddleOCR：**2.7.2** 问题相关组件/Related components： - 运行指令/Command Code： - 完整报错/Complete Error Message：请尽量不要包含图片在问题中/Please try to...

Error(s) in loading state_dict for VideoLLAMA

Hi, I am running the demo with only VL branch, I set the checkpoint path like: ```shell llama_model: "model_weights/vicuna_final/" ckpt: '/home/ubuntu/Documents/Video-LLaMA/model_weights/Pre-trained_Visual_Encoder/pretrained_minigpt4.pth' # you can use our pretrained ckpt from https://huggingface.co/DAMO-NLP-SG/Video-LLaMA-2-13B-Pretrained/...

What is the learnable queries like BLIP2 to serve as the input of video Q-former?

Thank you in advance!

example_chat.py does not support multi-images as input?

Thank you for the favorable work! [Inference on Multiple GPUs](https://github.com/InternLM/InternLM-XComposer?tab=readme-ov-file#inference-on-multiple-gpus) in README calls [example_chat.py](https://github.com/InternLM/InternLM-XComposer/blob/main/examples/example_chat.py), but it seems like the code does not support multi-images as input. When I organize 2...

How can I annotate the Foreground indicator, Boundary offsets, and Saliency score on my own moment retrieval dataset?

Thank you for the wonderful work! I want to finetune UniVTG on my moment retrieval dataset. I wonder how do authors re-annotate the original json files of moment retrieval datasets...

How to use Osprey-Chat to generate a short description for all masks of an image?

Thank you for the amazing work! I saw the Offline demo you introduced, but it seems that this process still requires manual clicking on specific masks to generate a specific...