pygbm icon indicating copy to clipboard operation
pygbm copied to clipboard

Investigate the discrepancy in default hyperparams compared to LightGBM

Open ogrisel opened this issue 6 years ago • 4 comments

Possible culprits:

  • shrinkage / learning_rate
  • min_samples_leaf
  • min_child_samples

See details in https://github.com/ogrisel/pygbm/issues/30#issuecomment-435091127.

ogrisel avatar Nov 01 '18 16:11 ogrisel

As @NicolasHug noted, our min_samples_leaf in pygbm is not correct. I would rather implement what LightGBM does, that is reject splits that would result in one of the child nodes having less than min_samples_leaf.

ogrisel avatar Nov 01 '18 17:11 ogrisel

You mean sklearn?

LightGBM is doing something very weird with min_sample_leaf, it looks like it is ignored because of num_leaves (see https://github.com/ogrisel/pygbm/issues/30#issuecomment-435138526)

NicolasHug avatar Nov 01 '18 18:11 NicolasHug

@NicolasHug I think you used the wrong parameter name in that code.

guolinke avatar Nov 02 '18 05:11 guolinke

Indeed. It's actually the pygbm handling of min_samples_leaf that is broken. See: #34.

ogrisel avatar Nov 02 '18 10:11 ogrisel