coddderX

Results 5 comments of coddderX

This may be because the validation dataset is loaded in memory on every executor. Therefore, a large validation dataset may cause out of memory errors

From the source code perspective, validateData is broadcasted to each executor without any compression processing, which leads to a significant consumption of memory. Therefore, currently, the only way to avoid...

![Image](https://github.com/user-attachments/assets/2994b306-9470-4493-9777-2cd5db9d146e)

![Image](https://github.com/user-attachments/assets/27505aa9-7bb3-4e7b-9850-93f0482e3c7f) This may be because the LightGBM C++ library does not support streaming validation datasets, but does support streaming training datasets. Therefore, I think the room for improvement in SynapseML...