datumaro icon indicating copy to clipboard operation
datumaro copied to clipboard

Add dataset quality checks

Open zhiltsov-max opened this issue 3 years ago • 1 comments

When there is a dataset, it is often important to be able to check its quality. There are few way to check it:

  • compare annotations against a specification. Find too small boxes, too large boxes, too close points etc.
  • compare annotations from several sources, match by distance
  • annotate a dataset with a model, compare annotations against the inference results

It would be nice to support these and other approaches to quality checking.

Currently, in 0.1.6, the approaches 2 and 3 are implemented as datum merge.

zhiltsov-max avatar Mar 04 '21 13:03 zhiltsov-max

Added dataset validation for classification, detection and segmentation tasks in #160, #227, #233

zhiltsov-max avatar May 27 '21 08:05 zhiltsov-max