LightGBM icon indicating copy to clipboard operation
LightGBM copied to clipboard

At which stage of the algorithm does GOSS operate?

Open CHENHUI-X opened this issue 2 years ago • 2 comments

Hi, I have a question about the GOSS algorithm. According to the content I saw in the paper and the integration information on the Internet, I learned that GOSS can reduce the amount of data through weight sampling. I want to know the execution order of the GOSS algorithm and the histogram algorithm : first use the GOSS algorithm to sample the dataset, and then divide the sample data to bins , right ? Or what order, please help me.

CHENHUI-X avatar Feb 25 '23 09:02 CHENHUI-X

Firstly, the data sample strategy is bagging by default.

So, GOSS algorithm won't work if you don't set core parameter data_sample_strategy = goss.

Secondly, constructing the histogram is after sampling the dataset.

AllenSun1024 avatar Apr 25 '23 10:04 AllenSun1024

Firstly, the data sample strategy is bagging by default.

So, GOSS algorithm won't work if you don't set core parameter data_sample_strategy = goss.

Secondly, constructing the histogram is after sampling the dataset.

Got it, thank you very much !

CHENHUI-X avatar Apr 25 '23 15:04 CHENHUI-X

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

github-actions[bot] avatar Nov 01 '23 00:11 github-actions[bot]