Samuel Wilson

Results 48 comments of Samuel Wilson

I second this. This type of encoding is very useful for linear modeling especially. It has an averaging effect on ordinal variables that is much more stable than simple one-hot...

This error means that all of the outputs from the tuning function have the same value. This causes singularity issues when trying to train a Gaussian Process.

You would need to create multiple parameters that can be mapped from the scoring function to the vector. For instance, neuralnet has a parameter hidden which can be c(4,5) for...

Can you describe the general idea of the algorithm here?

The imputed values will still contain noise, since random forests inherently have random aspects to them in the training process. I do like the idea of adding noise based on...

Does running with `parallel = FALSE` fix the problem? Also, can you show me a summary of the dataframe, as well as the output of `sessionInfo()`.

This is probably caused by a hard coded limitation to 100 samples in the background set: https://github.com/slundberg/shap/issues/1991 Caused by this guy: https://github.com/slundberg/shap/blob/b6e90c859fdfc6bc145242d9a8082d4ad844e995/shap/maskers/_tabular.py#L243

This might have been caused by not resetting the index on the data that was being imputed. There are assertions in major version 6 to keep these bugs from happening....

If you have a reproducible example where an unhelpful error is thrown, I can try to improve the error handling for this - but for now it's a tough problem...

Can you guys post a reproducible example - not sure why this would happen.