xgboost_ray icon indicating copy to clipboard operation
xgboost_ray copied to clipboard

add support for multi-output prediction

Open yc2984 opened this issue 1 year ago • 1 comments

Currently, xgboost_ray doesn't support multi-output, neither the two options work.

  1. providing a list of labels: RayDMatrix(path, label=label_cols, filetype=RayFileType.PARQUET)
  2. providing a concrete data frame with multiple columns. RayDMatrix(data=df[feature_columns], label=df[target_columns])

However the second option is supported by the original xgboost package. Here is an issue tracking for future development for multi-output: https://github.com/dmlc/xgboost/issues/9043. A few distributed options are mentioned, but not ray, is there a plan to develop this feature soon for xgboost_ray as well? cc @Yard1

I also asked this in ray discussion forum: https://discuss.ray.io/t/does-xgboost-ray-supports-multi-output-many-y-labels/11383

yc2984 avatar Jul 14 '23 12:07 yc2984

cc @krfricke

Yard1 avatar Jul 14 '23 16:07 Yard1