clipper Ask about the performance issue of pyspark model serving

Ask about the performance issue of pyspark model serving

Open parkerzf opened this issue 6 years ago • 3 comments

One concern for the pyspark model serving is the real time performance or latency.

Clipper provides a wrapper of pyspark session, as mentioned in the document:

The model container creates a long-lived SparkSession when it is first initialized and uses that to load this model once at initialization time. The long-lived SparkSession and loaded model are provided by the container as arguments to the prediction function each time the model container receives a new prediction request.

However, it seems that the long-lived SparkSession is heavy because of the communication between spark master and worker, even on the single machine. Do you have results about the latency of the spark model serving?

Thanks!

May 24 '18 05:05 parkerzf

@simon-mo @withsmilo Any thought?

May 29 '19 03:05 rkooo567

MLFlow uses https://github.com/combust/mleap for low-latency SparkML serving. It seems to be nice for us.

May 29 '19 03:05 withsmilo

@withsmilo I will take a look at it!

May 29 '19 04:05 rkooo567

clipper clipper copied to clipboard

Ask about the performance issue of pyspark model serving

clipper
clipper copied to clipboard