datumaro icon indicating copy to clipboard operation
datumaro copied to clipboard

Ability to keep the most significant video frames in dataset

Open zhiltsov-max opened this issue 3 years ago • 0 comments

A video can contain lots of frames, most of which do not bring any new information. A dataset created from a video can be significantly reduced without affecting model performance, if we keep only the most significant frames.

References:

  • BubbleNets: https://arxiv.org/pdf/1903.11779.pdf, https://github.com/griffbr/BubbleNets
  • Frames can be checked for similarity (similarly to near duplicate removal)

zhiltsov-max avatar Mar 04 '21 12:03 zhiltsov-max