place_recognition_dataset_icra2018
                                
                                 place_recognition_dataset_icra2018 copied to clipboard
                                
                                    place_recognition_dataset_icra2018 copied to clipboard
                            
                            
                            
                        Outdoors datasets especially recorded for place recognition applications using both flying and hand-held setups.
V4RL Urban Place Recognition Dataset
This dataset provides outdoors sequences especially recorded for place recognition applications using both flying and hand-held setups. This dataset was made publicly available with the paper "Viewpoint-tolerant Place Recognition combining 2D and 3D information for UAV navigation", by Fabiola Maffra, Zetao Chen and Margarita Chli, published in the Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2018 [paper].
Video:
If you use this dataset, please cite the following publication:
@inproceedings{FMaffra:etal:ICRA2018,
    title     = {Viewpoint-tolerant Place Recognition combining 2D and 3D information for UAV navigation},
	author    = {Maffra, Fabiola and Chen, Zetao and Chli, Margarita},
	booktitle = {Proceedings of the {IEEE} International Conference on Robotics and Automation ({ICRA})},
	year      = {2018}
}
Shopping Street 1 & 2 Dataset
The Shopping Street sequences 1 and 2 were recorded using a hand-held setup with the camera facing perpendicular to the direction of motion (i.e. sideways) when walking down a busy shopping street with many pedestrians. Shopping Street 1 was recorded with the sensor held at eye-level and exhibits loops with small viewpoints changes, perceptual aliasing and changes in the scene appearance. Shopping Street 2 was recorded along the same street a few months later with the sensor mounted at the top of a 4m-long rod held vertically in order to capture a part of the street that is visible in Shopping Street 1, but from different viewpoints. Combining these two sequences a very challenging place recognition dataset is created, where the scene is not only revisited from very different viewpoints, but due to the large time interval between recordings, major changes in the appearance of the scene can be observed with most of the restaurants and shop windows in different configurations; e.g. shutters closed, window displays and even store logos changed. Moreover, parts of Shopping Street 2 exhibit large variance in illumination conditions, making it hard even for humans to detect that it is the same place visited in the first sequence.
Examples
Data
Shopping Street 1 Sequence 1 - Bagfile - Youtube
Shopping Street 1 Sequence 2 - Bagfile - Youtube
Shopping Street 2 - Bagfile - Youtube
Ground truth
The ground truth for each dataset was manually annotated using 2 different sequences, a query sequence and a reference sequence. The sequences used as reference and query for the available ground truth are:
- 
Shopping Street 1: - Reference Dataset: Shopping Street 1 sequence 1
- Query Dataset : Shopping Street 1 sequence 2
 
- 
Shopping Street 1 & 2: - Reference Dataset: Shopping Street 1 sequence 1
- Query Dataset : Shopping Street 2
 
More details about the ground truth are available on the links below.
Shopping Street 1 - Ground truth
Shopping Street 1 & 2 - Ground truth
UAV dataset
This sequence was recorded along a residential street using the Visual-Inertial sensor mounted on the bottom of an AscTec Neo UAV in a front-looking configuration, while performing lateral movements with the UAV in both directions. This sequence exhibits perceptual aliasing as well as large variance in viewpoints and difficult lighting conditions.
Examples
Data
UAV dataset Sequence 1 - Bagfile - Youtube
UAV dataset Sequence 2 - Bagfile - Youtube
Calibration
The images were captured using a VI-Sensor and calibrated using ETHZ ASL Kalibr. Below are the calibration parameters. Note that T_SC is the transformation from the Camera to the Sensor (IMU). The set of values correspond to camera0's intrinsics.
- {T_SC:     
    [ 0.9999921569165363, 0.003945890103835121, 0.0003406709575200133, -0.030976405894694664,        
     -0.003948017768440125, 0.9999711543561547, 0.0064887295612456805, 0.003944069243840622,         
     -0.00031505731688472255, -0.0064900236445916415, 0.9999788899431723, -0.016723945219020563,
     0.0, 0.0, 0.0, 1.0],
    image_dimension: [752, 480],
    distortion_coefficients: [0.0038403216668672986, 0.025065957244781098, -0.05227986912373674, 0.03635919730588422],
    distortion_type: equidistant,
    focal_length: [464.2604856754006, 463.0164764480498],
    principal_point: [372.2582270417875, 235.05442086962864]}
Contact
For any questions or bug reports, please create an issue or contact me at [email protected].





