LoFTR how to create ground truth?

how to create ground truth?

Open trand2k opened this issue 1 year ago • 17 comments

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?
My dataset dont have depth image, can i label pair of key point and using as a grounth truth?
Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Mar 24 '23 04:03 trand2k

check Issue https://github.com/zju3dv/LoFTR/issues/243 for number 2. you will need to refactor a few parts of the code and do not need the supervision in that case, you will need to build your own "supervisor". As long as your datasets has depth you can build your dataset class updating the important keys. Then the supervision does his job and you could also check the coarse_matching.py module. That's all I understand, I hope it helps (I'm not one of the authors just one more enthusiast here). Edit: I forgot to mention that you will need to add your dataset class in the data.py flow which is the data loader

Mar 27 '23 01:03 ACSL-ricardo

Thanks for your response, I already understand how to create grouth truth. But build my "supervisor" is difficult task, if i label pair of point in 2 image, have some area can miss in Fine-level Supervision, Does it affect the results?

Mar 27 '23 03:03 trand2k

Probably it would affect your results, Take in your consideration how the loss function is calculated, it uses the loss of the coarse-level and fine-level. I recommend you keep using rgbd datasets easy to fit to the project like https://cvg.cit.tum.de/data/datasets

Mar 27 '23 05:03 ACSL-ricardo

do you try trainning super point and super glue for this task, it seemly helpful for my case

Mar 27 '23 06:03 trand2k

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?

My dataset dont have depth image, can i label pair of key point and using as a grounth truth?

Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Have you successfully trained the LoFTR model using a custom dataset? I'm also in the process of training it with my own dataset, without using depth information. However, I've encountered some challenges in creating my dataset and understanding the training process. I'd like to ask you a few questions. I would greatly appreciate your assistance.

1.In our own dataset, how can we create our own dataset include h5 (depth) and npz files for proper training? Could you provide guidance on creating npz files that contain information related to the five parameters? 2.In the context of LoFTR training, is it possible to exclude depth information, such as not using h5 (depth) files in the dataset for training? 3.If we wish to create a dataset for training, how should we modify the corresponding code?

Oct 18 '23 06:10 JiamuR

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?

My dataset dont have depth image, can i label pair of key point and using as a grounth truth?

Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Have you successfully trained the LoFTR model using a custom dataset? I'm also in the process of training it with my own dataset, without using depth information. However, I've encountered some challenges in creating my dataset and understanding the training process. I'd like to ask you a few questions. I would greatly appreciate your assistance.

1.In our own dataset, how can we create our own dataset include h5 (depth) and npz files for proper training? Could you provide guidance on creating npz files that contain information related to the five parameters? 2.In the context of LoFTR training, is it possible to exclude depth information, such as not using h5 (depth) files in the dataset for training? 3.If we wish to create a dataset for training, how should we modify the corresponding code?

I have some point for you :

have video from mono camera, u can use some library support structure from motion for generate depth image and pose of each image, you can use this for training LOFTR
yes, if you have each pair in 2 image, note that , LOFTR is have 2 level , coarse level and fine-gain level, I only train coarse level for my dataset, note that if you label each pair, in loss function, you need to filter out all patch in P/8 level don't have key-point matching labeled before push it into cross entropy loss
My corresponding code belongs to the company, it is confidential, you can follow my instruction for training LOFTR. Good luck

Oct 18 '23 07:10 trand2k

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?

My dataset dont have depth image, can i label pair of key point and using as a grounth truth?

Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Have you successfully trained the LoFTR model using a custom dataset? I'm also in the process of training it with my own dataset, without using depth information. However, I've encountered some challenges in creating my dataset and understanding the training process. I'd like to ask you a few questions. I would greatly appreciate your assistance. 1.In our own dataset, how can we create our own dataset include h5 (depth) and npz files for proper training? Could you provide guidance on creating npz files that contain information related to the five parameters? 2.In the context of LoFTR training, is it possible to exclude depth information, such as not using h5 (depth) files in the dataset for training? 3.If we wish to create a dataset for training, how should we modify the corresponding code?

I have some point for you :

have video from mono camera, u can use some library support structure from motion for generate depth image and pose of each image, you can use this for training LOFTR

yes, if you have each pair in 2 image, note that , LOFTR is have 2 level , coarse level and fine-gain level, I only train coarse level for my dataset, note that if you label each pair, in loss function, you need to filter out all patch in P/8 level don't have key-point matching labeled before push it into cross entropy loss

My corresponding code belongs to the company, it is confidential, you can follow my instruction for training LOFTR. Good luck

Thank you for your response.Benefit a lot. I'm new to this field and have just started working on this project, so I have some questions. Thank you for listening. Here's my current situation: I plan to perform feature point matching between drone-captured images and satellite images to improve localization. I already have aerial images and corresponding satellite image data.

1.It's challenging for me to generate depth images based on this setup. I'd like to train without depth information (without using h5 depth files), but I'm not sure how to remove depth information and what to consider during training.And whether it's possible? 2.Can you provide more specific information on using certain library support structure to generate image poses? How should I go about generating intrinsics, poses, and pair_infos for the npz file? 3.Once the npz file is prepared, does this mean the dataset is ready for training? Are there any additional considerations during the training process?

I greatly appreciate your guidance; your insights will help me gain a deeper understanding of this field.

Oct 18 '23 07:10 JiamuR

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?

My dataset dont have depth image, can i label pair of key point and using as a grounth truth?

Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Have you successfully trained the LoFTR model using a custom dataset? I'm also in the process of training it with my own dataset, without using depth information. However, I've encountered some challenges in creating my dataset and understanding the training process. I'd like to ask you a few questions. I would greatly appreciate your assistance. 1.In our own dataset, how can we create our own dataset include h5 (depth) and npz files for proper training? Could you provide guidance on creating npz files that contain information related to the five parameters? 2.In the context of LoFTR training, is it possible to exclude depth information, such as not using h5 (depth) files in the dataset for training? 3.If we wish to create a dataset for training, how should we modify the corresponding code?

I have some point for you :

have video from mono camera, u can use some library support structure from motion for generate depth image and pose of each image, you can use this for training LOFTR

yes, if you have each pair in 2 image, note that , LOFTR is have 2 level , coarse level and fine-gain level, I only train coarse level for my dataset, note that if you label each pair, in loss function, you need to filter out all patch in P/8 level don't have key-point matching labeled before push it into cross entropy loss

My corresponding code belongs to the company, it is confidential, you can follow my instruction for training LOFTR. Good luck

Thank you for your response.Benefit a lot. I'm new to this field and have just started working on this project, so I have some questions. Thank you for listening. Here's my current situation: I plan to perform feature point matching between drone-captured images and satellite images to improve localization. I already have aerial images and corresponding satellite image data.

1.It's challenging for me to generate depth images based on this setup. I'd like to train without depth information (without using h5 depth files), but I'm not sure how to remove depth information and what to consider during training.And whether it's possible? 2.Can you provide more specific information on using certain library support structure to generate image poses? How should I go about generating intrinsics, poses, and pair_infos for the npz file? 3.Once the npz file is prepared, does this mean the dataset is ready for training? Are there any additional considerations during the training process?

I greatly appreciate your guidance; your insights will help me gain a deeper understanding of this field.

My work is seem like you, you can discuss with my boss, you can find our demo in here : HERE

Oct 18 '23 07:10 trand2k

Hi authors, Thank you for your repo, i want trainning your model with my custom dataset, i have some question?

What is ground truth of your model. i see your generate pair of keypoint using depth image, is this right?

My dataset dont have depth image, can i label pair of key point and using as a grounth truth?

Can you explain for me how to use Depth image for find matching key point? thanks for your help .

Have you successfully trained the LoFTR model using a custom dataset? I'm also in the process of training it with my own dataset, without using depth information. However, I've encountered some challenges in creating my dataset and understanding the training process. I'd like to ask you a few questions. I would greatly appreciate your assistance. 1.In our own dataset, how can we create our own dataset include h5 (depth) and npz files for proper training? Could you provide guidance on creating npz files that contain information related to the five parameters? 2.In the context of LoFTR training, is it possible to exclude depth information, such as not using h5 (depth) files in the dataset for training? 3.If we wish to create a dataset for training, how should we modify the corresponding code?

I have some point for you :

have video from mono camera, u can use some library support structure from motion for generate depth image and pose of each image, you can use this for training LOFTR

yes, if you have each pair in 2 image, note that , LOFTR is have 2 level , coarse level and fine-gain level, I only train coarse level for my dataset, note that if you label each pair, in loss function, you need to filter out all patch in P/8 level don't have key-point matching labeled before push it into cross entropy loss

My corresponding code belongs to the company, it is confidential, you can follow my instruction for training LOFTR. Good luck

Thank you for your response.Benefit a lot. I'm new to this field and have just started working on this project, so I have some questions. Thank you for listening. Here's my current situation: I plan to perform feature point matching between drone-captured images and satellite images to improve localization. I already have aerial images and corresponding satellite image data. 1.It's challenging for me to generate depth images based on this setup. I'd like to train without depth information (without using h5 depth files), but I'm not sure how to remove depth information and what to consider during training.And whether it's possible? 2.Can you provide more specific information on using certain library support structure to generate image poses? How should I go about generating intrinsics, poses, and pair_infos for the npz file? 3.Once the npz file is prepared, does this mean the dataset is ready for training? Are there any additional considerations during the training process? I greatly appreciate your guidance; your insights will help me gain a deeper understanding of this field.

My work is seem like you, you can discuss with my boss, you can find our demo in here : HERE

MY ANSWER :

yes, it possible
with drone image, try opendronemap, you need start with debug opendronemap, native build and debug step by step
yes, mono image, depth image, pose of 2 camera is all you need to trainning Loftr

Oct 18 '23 07:10 trand2k

嗨，作者，感谢您的存储库，我想使用我的自定义数据集训练您的模型，我有一些问题吗？

什么是模型的基本事实。我看到您使用深度图像生成一对关键点，对吗？

我的数据集没有深度图像，我可以标记一对关键点并用作最糟糕的事实吗？

你能为我解释如何使用深度图像来查找匹配的关键点吗？感谢您的帮助.

您是否使用自定义数据集成功训练了 LoFTR 模型？我也在用我自己的数据集训练它，而不使用深度信息。但是，我在创建数据集和理解训练过程时遇到了一些挑战。我想问你几个问题。我将非常感谢您的协助。 1.In 我们自己的数据集，我们如何创建自己的数据集，包括 h5（深度）和 npz 文件以进行适当的训练？您能否提供有关创建包含与五个参数相关的信息的 npz 文件的指导？2.In LoFTR 训练的上下文中，是否可以排除深度信息，例如不使用数据集中的 h5（深度）文件进行训练？3.如果我们想创建一个用于训练的数据集，我们应该如何修改相应的代码？

我有一些观点要告诉你：

有来自单色相机的视频，你可以使用一些库支持结构从运动生成深度图像和每个图像的姿势，你可以用它来训练LOFTR

是的，如果你在 2 张图像中有每对，请注意，LOFTR 有 2 级、粗级和细增益级，我只为我的数据集训练粗级，请注意，如果你标记每对，在损失函数中，你需要过滤掉 P/8 级别的所有补丁，在将其推入交叉熵损失之前没有标记关键点匹配

我的对应代码属于公司，是保密的，你可以按照我的指示进行LOFTR的培训。祝你好运

感谢您的回复。受益匪浅。我是这个领域的新手，刚刚开始从事这个项目，所以我有一些问题。感谢您的聆听。这是我目前的情况：我计划在无人机捕获的图像和卫星图像之间进行特征点匹配，以提高定位。我已经有了航拍图像和相应的卫星图像数据。 1.基于此设置生成深度图像对我来说具有挑战性。我想在没有深度信息的情况下进行训练（不使用 h5 深度文件），但我不确定如何删除深度信息以及在训练期间要考虑什么。是否可能？2.您能否提供有关使用某些库支撑结构生成图像姿势的更具体信息？我应该如何为 npz 文件生成内部函数、姿势和pair_infos？3.准备好 npz 文件后，这是否意味着数据集已准备好进行训练？在培训过程中还有其他注意事项吗？我非常感谢您的指导;您的见解将帮助我对这个领域有更深入的了解。

我的工作看起来像你，你可以和我的老板讨论，你可以在这里找到我们的演示：这里

我的回答：

是的，有可能

使用无人机图像，尝试OpenDroneMap，您需要从调试OpenDroneMap开始，本机构建和逐步调试

是的，单色图像，深度图像，2个相机的姿势就是训练Loftr所需要的

Thank you for your reply. Your answer is very helpful to me. I can't get to your demo site right now, but thank you very much for your advice. If there is any follow-up, I will contact you. thank you

Oct 18 '23 07:10 JiamuR

LoFTR LoFTR copied to clipboard

how to create ground truth?

LoFTR
LoFTR copied to clipboard