AF-SfMLearner outputs["position_depth", scale, frame

outputs["position_depth", scale, frame_id]的维度问题

Open nanasylum opened this issue 10 months ago • 1 comments

很感谢您所做的工作！

我在您网络的基础上试图改进时，想使用tgt_depth和ref_depth这两个变量。根据我对train_stage_two.py的理解，tgt_depth应该是指outputs["depth", 0, scale]，ref_depth应该是指outputs["position_depth", scale, frame_id]。但是我却发现outputs["position_depth", scale, frame_id]的shape是[12,2,256,320]。为什么第二个维度是2？不应该是3吗？

期待您的回复

Apr 05 '24 09:04 nanasylum

还有一个问题，请问train_stage_two.py文件中，outputs["position_depth"]和outputs["depth"]有什么区别吗？

Line421 我的理解是用视差计算深度

            _, depth = disp_to_depth(disp, self.opt.min_depth, self.opt.max_depth)

            outputs[("depth", 0, scale)] = depth

Line457 但是这里就不太理解，输入相机的内参可以怎么样？

                outputs[("position_depth", scale, frame_id)] = self.position_depth[source_scale](
                        cam_points, inputs[("K", source_scale)], T)

期待您的回复

Apr 06 '24 08:04 nanasylum

AF-SfMLearner AF-SfMLearner copied to clipboard

outputs["position_depth", scale, frame_id]的维度问题

AF-SfMLearner
AF-SfMLearner copied to clipboard