xgboost_ray
xgboost_ray copied to clipboard
add support for multi-output prediction
Currently, xgboost_ray doesn't support multi-output, neither the two options work.
- providing a list of labels:
RayDMatrix(path, label=label_cols, filetype=RayFileType.PARQUET)
- providing a concrete data frame with multiple columns.
RayDMatrix(data=df[feature_columns], label=df[target_columns])
However the second option is supported by the original xgboost package. Here is an issue tracking for future development for multi-output: https://github.com/dmlc/xgboost/issues/9043. A few distributed options are mentioned, but not ray, is there a plan to develop this feature soon for xgboost_ray as well? cc @Yard1
I also asked this in ray discussion forum: https://discuss.ray.io/t/does-xgboost-ray-supports-multi-output-many-y-labels/11383
cc @krfricke