nyaggle icon indicating copy to clipboard operation
nyaggle copied to clipboard

Starter notebook is in progress.

Open wakame1367 opened this issue 5 years ago • 2 comments

What content should be included? below link is draft. https://drive.google.com/open?id=1aSiplVhB9Hjcj8Ib-A9zkAjEgykpllH4

wakame1367 avatar Feb 21 '20 03:02 wakame1367

Oh, cool! Since nyaggle is a collection of independent utility functions, I think we don't need to visit all modules (this use case should rather be covered by the API reference).

If the notebook covers an end-to-end modeling process in typical tabular data competition, it would be very valuable for data-science beginners. In that case, for example, the content would be consist of the following steps:

  1. load train.csv, test.csv, sample_submission.csv
  2. manual feature engineering
  3. save/load features using feature_store (cached_feature decorator is suitable for a single notebook)
  4. (optional) check trian/test difference by adversarial_validation
  5. call run_experiment with mlflow
  6. see the result in the notebook and localhost:5000

The following notebook will be helpful as examples with similar purpose.

nyanp avatar Feb 21 '20 14:02 nyanp

Starter notebook moved. nyaggle-tutorial-for-titanic.ipynb

wakame1367 avatar Mar 06 '20 06:03 wakame1367