tiesanguaixia

Results 15 issues of tiesanguaixia

I ran this code in README.md ``` usage: train.py [-h] [--backbone {resnet,xception,drn,mobilenet}] [--out-stride OUT_STRIDE] [--dataset {pascal,coco,cityscapes}] [--use-sbd] [--workers N] [--base-size BASE_SIZE] [--crop-size CROP_SIZE] [--sync-bn SYNC_BN] [--freeze-bn FREEZE_BN] [--loss-type {ce,focal}] [--epochs...

Thank you for your great work! I also want to ask how can I load the weight of _Q-Former_ weight only? Is the any code for reference? Looking forward to...

我把所有视频放在列表里面去遍历,依次生成caption. 但十几个视频之后程序就会卡在输出caption的answer那里。请问这是为什么呢?非常感谢!

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem 【没有报错】 - 系统环境/System Environment:**CUDA 11** - 版本号/Version:Paddle:**paddlepaddle==2.5.2** PaddleOCR:**2.7.2** 问题相关组件/Related components: - 运行指令/Command Code: - 完整报错/Complete Error Message: 请尽量不要包含图片在问题中/Please try to...

Hi, I am running the demo with only VL branch, I set the checkpoint path like: ```shell llama_model: "model_weights/vicuna_final/" ckpt: '/home/ubuntu/Documents/Video-LLaMA/model_weights/Pre-trained_Visual_Encoder/pretrained_minigpt4.pth' # you can use our pretrained ckpt from https://huggingface.co/DAMO-NLP-SG/Video-LLaMA-2-13B-Pretrained/...

Thank you for the favorable work! [Inference on Multiple GPUs](https://github.com/InternLM/InternLM-XComposer?tab=readme-ov-file#inference-on-multiple-gpus) in README calls [example_chat.py](https://github.com/InternLM/InternLM-XComposer/blob/main/examples/example_chat.py), but it seems like the code does not support multi-images as input. When I organize 2...

Thank you for the wonderful work! I want to finetune UniVTG on my moment retrieval dataset. I wonder how do authors re-annotate the original json files of moment retrieval datasets...

Thank you for the amazing work! I saw the Offline demo you introduced, but it seems that this process still requires manual clicking on specific masks to generate a specific...