IIM icon indicating copy to clipboard operation
IIM copied to clipboard

Training on general dataset

Open akshu281 opened this issue 1 year ago • 2 comments

Hello authors,

I am trying to put together a custom dataset which has both free view and surveillance view images. May I check the following with you.

  1. What's the general code to refer to for preparing data using scale detect approach as the one with traditional one is giving me continuous blobs for a few images if I try to run a common way of generating the maps for all of my images
  2. Also I understand the training process may take a longer time and resources to train. In my case my custom dataset is around 7K and the resolution varies from small to high. Would you advise which backbone or parameters I can borrow here to use in 2 GPUs with around 8-11 GB memory

Thank you in advance for the timely acknowledgment and response

akshu281 avatar Apr 11 '23 03:04 akshu281

Hi,

  1. The code in datasets/dataset_prepare could be helpful to generate the binary map for your dataset. If your dataset is without bounding box annotation or scale map. You can set a fixed box size to generate an instance map, which may loss some performance but can help to start the training.
  2. Considering your device, we suggest you use the VGG backbone in this repo.

taohan10200 avatar Apr 11 '23 05:04 taohan10200

Thank you for the prompt reply @taohan10200 ! My annotations are all point wise for the dataset. May I check if I can use the scale prediction network and get the maps? Is there any change that I should be making to the codes under datasets/dataset_prepare as I see designated codes for different dataset.

Also may I check if resizing module as part of dataset preparation is fixed to any size for any dataset we use?

akshu281 avatar Apr 11 '23 06:04 akshu281