Realtime_Multi-Person_Pose_Estimation Data structure in lmdb

Data structure in lmdb

Open michalfaber opened this issue 6 years ago • 1 comments

I am trying to understand what is going on in the cpm_data_transformer. I've found that each image is stored in lmdb as a data structure: 6 x width x height - 3 channels, metadata, miss mask and all mask. Why is metadata encoded in such a big buffer? Is it for performance reasons? Did you try to generate augmented dataset with just images and labels (vec and heat) before training ?

Sep 13 '17 16:09 michalfaber

My wild guess is they where writing custom caffe layer and were limited in types of data layer could receive, so the encoded everything in additional image layer.

Nov 13 '17 21:11 anatolix

Realtime_Multi-Person_Pose_Estimation Realtime_Multi-Person_Pose_Estimation copied to clipboard

Data structure in lmdb

Realtime_Multi-Person_Pose_Estimation
Realtime_Multi-Person_Pose_Estimation copied to clipboard