autogluon-benchmarking icon indicating copy to clipboard operation
autogluon-benchmarking copied to clipboard

Kaggle benchmarks

Open mglowacki100 opened this issue 4 years ago • 1 comments

Hi Nick

I've read yours https://arxiv.org/abs/2003.06505 I think that mentioning that for at least couple of kaggle competitions obtaining top-scores is not possible without using data-leaks would be more fair to automated ml and would add some value to paper. For example:

  1. https://www.kaggle.com/c/santander-customer-transaction-prediction: https://www.kaggle.com/c/santander-customer-transaction-prediction/discussion/85125
  2. https://www.kaggle.com/c/santander-value-prediction-challenge: https://www.kaggle.com/c/santander-value-prediction-challenge/discussion/63907
  3. https://www.kaggle.com/c/ieee-fraud-detection: https://www.kaggle.com/c/ieee-fraud-detection/discussion/111284 Detection and exploit such leaks is hard even for human.

Btw. do you have plans for paper release with updated benchmarks?

Anyways great work :)

mglowacki100 avatar Dec 03 '20 12:12 mglowacki100

Thanks for the info! We will try to incorporate it into the next version of the paper.

Regarding updated benchmarks, we are planning to have them sometime in March/April 2021 as we are currently working on v0.1 release.

[Update 2022]: This timeline was shifted and we are planning to have the new paper release in March/April 2022 (Accepted at JMLR, preparing camera ready).

Innixma avatar Dec 05 '20 20:12 Innixma