Skater Extending partial dependence for pyspark support

Extending partial dependence for pyspark support

Open pramitchoudhary opened this issue 7 years ago • 3 comments

The idea with the ticket is to explore the possibility of extending the support of PDP for MLLib models as well.

As In-Memory Model
- either using the existing implementation and extending the support spark DataFrames
- native spark implementation for a more scalable and performant implementation
As Deployed Model Depending on the complexity, there might be a need to split this task into sub tickets.
- May we start here, to get the idea flowing and then moving into In-Memory support. Just a thought. Will have to think more about it.

Thoughts ? @benvandyke @aikramer2

Jan 16 '18 03:01 pramitchoudhary

Also, not sure why 'Assignees' is not allowing to include you guys. Will figure it out.

Jan 16 '18 18:01 pramitchoudhary

@benvandyke any luck with this one?

Apr 11 '18 21:04 pramitchoudhary

@pramitchoudhary Haven't had a chance to look into this yet, I've been doing more ETL and pipelines than modeling lately.

Apr 17 '18 16:04 benvandyke