clipper icon indicating copy to clipboard operation
clipper copied to clipboard

Ask about the performance issue of pyspark model serving

Open parkerzf opened this issue 6 years ago • 3 comments

One concern for the pyspark model serving is the real time performance or latency.

Clipper provides a wrapper of pyspark session, as mentioned in the document:

The model container creates a long-lived SparkSession when it is first initialized and uses that to load this model once at initialization time. The long-lived SparkSession and loaded model are provided by the container as arguments to the prediction function each time the model container receives a new prediction request.

However, it seems that the long-lived SparkSession is heavy because of the communication between spark master and worker, even on the single machine. Do you have results about the latency of the spark model serving?

Thanks!

parkerzf avatar May 24 '18 05:05 parkerzf

@simon-mo @withsmilo Any thought?

rkooo567 avatar May 29 '19 03:05 rkooo567

MLFlow uses https://github.com/combust/mleap for low-latency SparkML serving. It seems to be nice for us.

withsmilo avatar May 29 '19 03:05 withsmilo

@withsmilo I will take a look at it!

rkooo567 avatar May 29 '19 04:05 rkooo567