GAMLET
GAMLET copied to clipboard
Add similarity search of datasets in OpenML database
We may add a similarity search that functions the following way:
- Extract necessary metafeatures from a custom dataset (implement corresponding MetaFeaturesExtractor in #3 or in another PR)
- Load the full OpenML datasets database
- Find the N nearest datasets by DatasetsSimilarityAssessor
- Load all evaluations for the closest datasets from OpenML datasets (implement corresponding ModelsLoader in #3). Filter out the best M models for each dataset
- Provide a final report to the user