waymo-open-dataset icon indicating copy to clipboard operation
waymo-open-dataset copied to clipboard

Objects formed by reflection effect.

Open XuyangBai opened this issue 3 years ago • 2 comments

Hi, Thanks for sharing this great dataset. For scenes with large glass, there usually exist some LiDAR points behind the glass maybe due to the reflection, and sometimes the points form well-shaped objects. I think such objects could harm the training process as they are not labelled as ground-truth boxes but can not be easily distinguished from ground-truth boxes without the maps or the road structure information. The following are examples:

519026_lidar_Inked

519026_img3

519026_lidar1

You can see there are three well-shaped cars behind the building, which I assume is the reflection of cars on the right side of the ego vehicle. And I use the preprocessing tools of mmdetection3d and I think this frame is from the training set, 519th segment, 26th frame.

The following example is from the validation set, 91st segment, 13th frame.

1091013_lidar

1091013_img3

I am not sure how many frames contain such cases, just wondering do you have any suggestions to avoid the potential negative effects of such reflected objects?

Best, Xuyang

XuyangBai avatar Nov 14 '21 07:11 XuyangBai

Detectors are supposed to understand them based on context and lidar point properties (e.g. second return).

peisun1115 avatar Nov 14 '21 22:11 peisun1115

Hi, Thanks a lot for your reply. It makes sense to me that the detectors have the potential to distinguish the reflected objects from the real objects based on context (i.e. there aren't any ground points for reflected cars). But could you explain more details about the second strategy(lidar point cloud properties)? What is the principle of distinguishing reflected objects using the two returns? Are the reflected points usually formed by the second return?

Moreover, what is the recommended way to combine lidar points from first and second returns? In mmdetection3d, I find points from two returns are simply concatenated without adding an indicator value. In this way, I think the network is unable to leverage the information about first and second returns.

Looking forward to your explanation.

Best, Xuyang

XuyangBai avatar Nov 25 '21 07:11 XuyangBai