benchmarks
benchmarks copied to clipboard
Spark Standalone Mode - production
When you test benchmark in standalone mode, pandas vs pyspark. I got curios, can it implement in production ?
How you set how many slave? Executor cores? Executor memory? Executor number?
And if I used spark on docker will it be better ? or best performance when on baremetal spark ?