hep_ml icon indicating copy to clipboard operation
hep_ml copied to clipboard

Odd behaviour of GBReweighter

Open marthaisabelhilton opened this issue 5 years ago • 3 comments

I am trying to use GBReweighter and am getting odd behaviour of the weights as seen in the figures attached. My parameters are as follows: reweighter = GBReweighter(n_estimators=40, learning_rate=0.1, max_depth=3, min_samples_leaf=1000, gb_args={'subsample': 0.4}) I have tried varying the parameters. Do you know what might cause this behaviour? Many thanks. D02KSPiPiDD_2012_original.pdf D02KSPiPiDD_2012_reweighted.pdf

marthaisabelhilton avatar May 08 '19 15:05 marthaisabelhilton

Can it be so that you have

  1. too few samples in MC and 1000 elements in the leaf are almost completely coming from real data?
  2. or there are regions in variable space where you have almost no MC? (I blindly guess that second is true, by looking at mu_P at lower values)

arogozhnikov avatar May 09 '19 16:05 arogozhnikov

Hello and thank you for your reply.

  1. I've checked and I have ~400000 events in the training sample for both MC and data. Is this ok?
  2. I actually made cuts on the data to make sure there is no regions without MC (after I made these plots)

marthaisabelhilton avatar May 10 '19 13:05 marthaisabelhilton

  1. yes, that should be more than enough

arogozhnikov avatar May 10 '19 18:05 arogozhnikov