xgboost_ray
xgboost_ray copied to clipboard
Add zero-copy DMatrix creation with Arrow
We are currently converting to Pandas before initialising the DMatrix. We should consider using Arrow instead to avoid unnecessary copies. XGBoost has Arrow support - https://github.com/dmlc/xgboost/pull/7512
Thanks for adding this! It looked like the changes were split over two PRs, just FYI here is the second https://github.com/dmlc/xgboost/pull/7283
And supporting polars dataframe for creating DMatrix in Python?