Realtime_Multi-Person_Pose_Estimation icon indicating copy to clipboard operation
Realtime_Multi-Person_Pose_Estimation copied to clipboard

Data structure in lmdb

Open michalfaber opened this issue 6 years ago • 1 comments

I am trying to understand what is going on in the cpm_data_transformer. I've found that each image is stored in lmdb as a data structure: 6 x width x height - 3 channels, metadata, miss mask and all mask. Why is metadata encoded in such a big buffer? Is it for performance reasons? Did you try to generate augmented dataset with just images and labels (vec and heat) before training ?

michalfaber avatar Sep 13 '17 16:09 michalfaber

My wild guess is they where writing custom caffe layer and were limited in types of data layer could receive, so the encoded everything in additional image layer.

anatolix avatar Nov 13 '17 21:11 anatolix