André
André
Hi @haowang-ms89 , `com.amazonaws.services.sagemaker.sparksdk.transformation.deserializers.XGBoostCSVRowDeserializer$$anonfun$deserializeResponse$1.apply(XGBoostCSVRowDeserializer.scala:41)` This line indicates that the XGBoost CSV deserializer is failing to deserialize the response from the XGBoost model. That number (`0.88765...`) is apparently one of the...
Hi @haowang-ms89 , Huh, it's possible that they don't log failed requests. Thank you for that warning, though. I'll update this issue when I hear back from them.
@haowang-ms89 , Yes, it sure would. If you can post it, I'll try to reproduce the issue.
Hi @haowang-ms89 , Thanks! I could reproduce this. I've contacted the XGBoost developers and asked them to take a look at what's going wrong.
@haowang-ms89 , There's a bug in the XGBoost container with the `multi:softprob` objective. Multi-dimensional arrays aren't being serialized from the hosting container back to the client in the expected format....
@haowang-ms89 , Yeah, I think that's right. Hyperparameters are passed in to XGBoost just as documented on the XGBoost GitHub page: https://github.com/dmlc/xgboost/blob/master/doc/parameter.md
@haowang-ms89 , Do they show up in your CloudWatch logs for your XGBoost training job? If not, we won't be able to get them, but if so: streaming logs from...
Labeling this as a bug and keeping this open to track the new output format for XGBoost for multi-dimensional arrays.
Adding a +1 to this enhancement request. @deadeyegoodwin -- wondering if your team has had a chance to scope this out?