Spark Standalone Mode - production

Open akuardit opened this issue 7 years ago • 0 comments

When you test benchmark in standalone mode, pandas vs pyspark. I got curios, can it implement in production ?

How you set how many slave? Executor cores? Executor memory? Executor number?

And if I used spark on docker will it be better ? or best performance when on baremetal spark ?

Nov 16 '18 06:11 akuardit