Remote-Sensing-RVSA issues

代码中value角度变换问题

1

sampling_angle_v = self.sampling_angles_v(x) sampling_angle_v = sampling_angle_k.reshape(num_predict_total, 1, window_num_h, window_num_w) 这里代码是不是应该为 sampling_angle_v = self.sampling_angles_v(x) sampling_angle_v = sampling_angle_v.reshape(num_predict_total, 1, window_num_h, window_num_w)

jiaotiancai

load pretrained backbone weights?

1

Hello, if it is possible could you provide an example on how to load the pretrained weights of, for example, the ViTAE-B backbone on the pretraining model trained on the...

ale93111

potsdam_vitae_rvsa_kvidff权重推理时的精度差异

4

您好，我在使用RVSA仓库中所给出的potsdam_vitae_rvsa_kvidff.pth权重进行推理时，结果有所出入，config文件除data_root以外未作修改。我的结果如下： ![image](https://github.com/ViTAE-Transformer/Remote-Sensing-RVSA/assets/77008579/9e9c1e4e-5060-4767-bbf4-14a19f532f6e) RVSA仓库中的log日志结果为 "aAcc": 0.9115, "mIoU": 0.8307, "mAcc": 0.9005, "mFscore": 0.9061, "mPrecision": 0.9124, "mRecall": 0.9005；两者所有指标差0.3%左右，我不太确定这是否可以认为两个结果是对齐的。我的测试集是使用mmseg官方脚本对‘2_Ortho_RGB.zip'’和‘5_Labels_all.zip'’进行划分，最后得到2016张512x512的测试集。虽然我猜测与测试集划分相关，但使用rsp_r50权重进行推理时，精度是能基本对齐的。https://github.com/ViTAE-Transformer/RSP/issues/15 我确实不太理解造成这种状况的原因，期待您的回复。

youngbaldy

Dota的test数据集复现mAP过低

4

我使用仓库主页提供的训练好的模型以及配置文件对Dota的test数据集进行推理，但是结果mAP仅有0.39. 下图为我所用的模型： ![image](https://github.com/user-attachments/assets/9e9957d2-3045-40ad-bbd8-72bd91a64813) 配置文件： ![image](https://github.com/user-attachments/assets/9bdaaef3-85c4-4eb2-9d09-692f91a04944) 测试指令： CUDA_VISIBLE_DEVICES=0 python /home/pp/OBBDetection/tools/test.py '/home/pp/Remote-Sensing-RVSA/Object Detection/configs/obb/oriented_rcnn/vit_base_win/faster_rcnn_orpn_our_rsp_vitae-nc-base-win-rvsa_v3_kvdiff_wsz7_fpn_1x_dota10_lr1e-4_ldr75_dpr15.py' '/home/pp/Model/vite_b_RSVA_kvdiff_ls/vitae_rvsa_kvdiff.pth' --format-only --show-dir '/home/pp/Model/vite_b_RSVA_kvdiff_ls/work_dirs/save' --options save_dir='/home/pp/Model/vite_b_RSVA_kvdiff_ls/work_dirs/save_val' nproc=1 提交给dota官方评估的结果： ![test](https://github.com/user-attachments/assets/9fb3d488-c251-4a08-87fe-55f3d073da81)

PP-explore

训练分割时tools/train文件找不到

19

请问训练分割时tools/train文件在哪

optimus20

wenjian文件名中kvdiff、wsz7代表什么含义

1

WenLinLliu

ViTAE_NC_Win_RVSA_V3_WSZ7预训练权重加载

1

当我加载vitae-b-checkpoint-1599-transform-no-average.pth时，出现了如下错误，我使用的数据集时potsdam Error(s) in loading state_dict for ViTAE_NC_Win_RVSA_V3_WSZ7: size mismatch for pos_embed: copying a param with shape torch.Size([1, 197, 768]) from checkpoint, the shape in current model is torch.Size([1, 1024, 768])....

Dawn-creat

请问论文中的ViT-B骨干网络权重放出来了吗

2

请问论文中以ViT-B作为骨干网络，oriented rcnn检测方法，MAE预训练这一项的权重和代码是否公开

WenLinLliu

预训练模型用于分割任务

6

作者您好，想向您请教下您是如何从预训练的ViT-B或ViTAE-B主干权重得到的分割任务中ViT-B + RVSA这里的权重呢，这一步的代码在哪里？也就是说如果我直接使用您的ViT-B或ViTAE-B预训练权重要怎样处理才能得到能用于mmsegmentation里pretrained=字段能够使用的权重呢？期望您的回复，感谢🙏。

chenyuabc10

OneDrive模型vit_rvsa.pth、vitae_rvsa.pth、vit_rvsa_kvdiff.pth、vitae_rvsa_kvdiff.pth下载链接失效

1

![image](https://github.com/user-attachments/assets/6d1bd735-047e-480f-ba70-b95b4c4bded8)

biggg2

Remote-Sensing-RVSA
Remote-Sensing-RVSA copied to clipboard

Metadata

代码中value角度变换问题

load pretrained backbone weights?

potsdam_vitae_rvsa_kvidff权重推理时的精度差异

Dota的test数据集复现mAP过低

训练分割时tools/train文件找不到

wenjian文件名中kvdiff、wsz7代表什么含义

ViTAE_NC_Win_RVSA_V3_WSZ7预训练权重加载

请问论文中的ViT-B骨干网络权重放出来了吗

预训练模型用于分割任务

OneDrive模型vit_rvsa.pth、vitae_rvsa.pth、vit_rvsa_kvdiff.pth、vitae_rvsa_kvdiff.pth下载链接失效

← Metadata

Owner

Metadata

Remote-Sensing-RVSA Remote-Sensing-RVSA copied to clipboard

Metadata

← Metadata

Owner

Metadata

Remote-Sensing-RVSA
Remote-Sensing-RVSA copied to clipboard