ActionDetection-AFSD icon indicating copy to clipboard operation
ActionDetection-AFSD copied to clipboard

Data Pre-processing for untrimmed videos on non-standard data

Open shubhamagarwal92 opened this issue 2 years ago • 2 comments

Hi, Congratulations on such a nice work! Also, thank you for open-sourcing the code! We are trying to use this code on our raw untrimmed videos and want to use this framework for temporal action localization.

We have our own non-standard data with 15 minutes of videos on avg at 30fps and a higher resolution (~500X900). We also have multiple actions in the videos.

For the activity net, I see that the max frames are specified to be 768

Could you please suggest if we need to split video into clips and what would be the length of each clip? Do we need to sample 256/768 frames uniformly? Or should we split clips based on the actions? Could you please point to any starter code that we could refer?

Thanks.

shubhamagarwal92 avatar Aug 23 '21 12:08 shubhamagarwal92