TITAN-NET
TITAN-NET copied to clipboard
TITAN-NET
In this work, we present a simple yet effective framework to address the domain translation problem between different sensor modalities with unique data formats. By relying only on the semantics of the scene, our modular generative framework can, for the first time, synthesize a panoramic color image from a given full 3D LiDAR point cloud. The framework starts with semantic segmentation of the point cloud, which is initially projected onto a spherical surface. The same semantic segmentation is applied to the corresponding camera image. Next, our new conditional generative model adversarially learns to translate the predicted LiDAR segment maps to the camera image counterparts. Finally, generated image segments are processed to render the panoramic scene images. We provide a thorough quantitative evaluation on the SemanticKitti dataset and show that our proposed framework outperforms other strong baseline models.
Example Videos
The following videos show the output of TITAN-NET and the generated RGB panoramic images.
In the following examples, we show how we can easily exploit the semantic-segmentation maps to perform data augmentation (KITTI and Cityscapes).
Citation
@misc{cortinhal2021semanticsaware, title={Semantics-aware Multi-modal Domain Translation:From LiDAR Point Clouds to Panoramic Color Images}, author={Tiago Cortinhal and Fatih Kurnaz and Eren Aksoy}, year={2021}, eprint={2106.13974}, archivePrefix={arXiv}, primaryClass={cs.CV} }
Checkout our previous work
You can check out previous work SalsaNext and
!
Disclamer
This repository is for research purposes only, the use of this code is your responsibility.