knowyourdata
knowyourdata copied to clipboard
A tool to help researchers and product teams understand datasets with the goal of improving data quality, and mitigating fairness and bias issues.
looks like it can't make toast
I believe would be a great addition to the documentation if you add a proper citation format for the too. Is there a specific whitepaper? Should we reference People +...
Hello I am wondering if there is any starter task posted for this project?
Attention, the i_naturalist2017 dataset has the labels misaligned from the items to which they refer, therefore, in the current state, it is completely useless.
I have some remote sensing images, and it would be helpful to use this structure to insert some samples/classes to teach deep learning for our applications. Regards
Hi folks. Is there any plan to set Know Your Data locally and run it on custom datasets? I think if the users could have their datasets in a way...
Really nice demo! I'm curious how `exposure_quality` and `sharpness_score` are computed. Both seem somewhat non-trivial to measure and I'm not sure what the standard process is in ML (as opposed...
Hello there, when are we going to support custom dataset?
I don't see any textual data in the demo. Are there any plans to have text data?
Bounding boxes for the `kitti` dataset are vertically flipped. This is a bug with the TensorFlow datasets implementation. See: https://knowyourdata-tfds.withgoogle.google.com/#dataset=kitti&tab=ITEM&draw=kyd/kitti/objects_type,bbox,bbox&item=000079.png ![image](https://user-images.githubusercontent.com/1100749/118014222-f541fd00-b320-11eb-8b1a-f3c16df4b702.png) See the BBoxFeature documentation on TensorFlow datasets: https://www.tensorflow.org/datasets/api_docs/python/tfds/features/BBoxFeature This...