spark-operator icon indicating copy to clipboard operation
spark-operator copied to clipboard

Updating to Spark 3.2.0

Open jordyperlee opened this issue 3 years ago • 16 comments

We are looking to update to Scala 2.13 (and eventually Scala 3), but that requires us to update Scala to 3.2.0. Is there any plan on updating the Spark base image for the operator?

jordyperlee avatar Jan 25 '22 14:01 jordyperlee

Hello,

Any update of this subject ?

mbaron2 avatar Feb 08 '22 09:02 mbaron2

We would be interested also in updating to 3.2.X

afranzi avatar Feb 10 '22 15:02 afranzi

Hello, We would also like to use spark 3.2.. Any update on support for this version ?

amalkhed avatar Feb 11 '22 05:02 amalkhed

Hey Team, any plans to release an operator version that officially supports 3.2? I checked that it actually works with a custom image of spark 3.2 and the latest operator image (3.1.1), but did not deeply tested.

josecsotomorales avatar Feb 14 '22 15:02 josecsotomorales

May we get an answer at some point?

cchantep avatar Mar 15 '22 12:03 cchantep

Hi folks! Can you please officially support Spark 3.2.0?

Felix-neko avatar Mar 23 '22 07:03 Felix-neko

+1

leandro-rouberte avatar Apr 08 '22 13:04 leandro-rouberte

Hi

any news regarding this issue?

many Q's here - can you estimate when will it be supported? thanks :)

sharonorman78 avatar Apr 11 '22 14:04 sharonorman78

I built an image based on spark-3.2.1-bin-hadoop3.2 Is that a problem?

allenhaozi avatar May 11 '22 08:05 allenhaozi

@allenhaozi your image works ? Can you share it ?

thanks ! :)

mbaron2 avatar May 11 '22 12:05 mbaron2

@mbaron2

I am currently in the test environment and can work Because it's still in beta, I can email you if you want

The main reference is the following items:

allenhaozi avatar May 12 '22 10:05 allenhaozi

Is there any new regarding this update to 3.2.0 officially?

sergialonsaco avatar May 27 '22 11:05 sergialonsaco

Spark 3.3.0 was released, I have a custom image of 3.2.1 running with the operator without any issues, and will experiment with 3.3.0 this week.

josecsotomorales avatar Jun 21 '22 01:06 josecsotomorales

In most cases using future version of spark is smooth, the main difference is typically in some configuration parameters that are being passed to the spark driver which you can also set in the spec if necessary. So folks, just build your own image and give it a try.

vburenin avatar Jul 13 '22 14:07 vburenin

Hi @vburenin I built a Spark 3.2.1 image which works fine but in some cases the Spark application shows this trace

22/08/18 11:14:17 WARN WatcherWebSocketListener: Exec Failure java.io.EOFException at okio.RealBufferedSource.require(RealBufferedSource.java:61) at okio.RealBufferedSource.readByte(RealBufferedSource.java:74) at okhttp3.internal.ws.WebSocketReader.readHeader(WebSocketReader.java:117) at okhttp3.internal.ws.WebSocketReader.processNextFrame(WebSocketReader.java:101) at okhttp3.internal.ws.RealWebSocket.loopReader(RealWebSocket.java:274) at okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:214) at okhttp3.RealCall$AsyncCall.execute(RealCall.java:203) at okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.base/java.lang.Thread.run(Unknown Source) 22/08/18 11:14:18 WARN ExecutorPodsWatchSnapshotSource: Kubernetes client has been closed (this is expected if the application is shutting down.) io.fabric8.kubernetes.client.WatcherException: too old resource version: 12998019 (13034876)

and hangs for hours without failing.

Could this be somehow related to the image version? Thx

wolliq avatar Aug 24 '22 06:08 wolliq

Spark 3.3.0 was released, I have a custom image of 3.2.1 running with the operator without any issues, and will experiment with 3.3.0 this week.

Hi @josecsotomorales can you share the dockerfile for this?

rahulbhatia2702 avatar Sep 19 '22 18:09 rahulbhatia2702

I would also be interested in this!

sousaaguilherme avatar Oct 25 '22 11:10 sousaaguilherme

I built it myself for testing allenhaozi/base-pyspark-3.2.1-py-v3.8:v0.1.0

allenhaozi avatar Nov 04 '22 11:11 allenhaozi

if anybody curious, it all works just fine if your target execution image is spark 3.3.

vburenin avatar Nov 04 '22 15:11 vburenin

I'm actually using the operator with spark 3.3 with a custom image, see this issue: https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/issues/1559

josecsotomorales avatar Nov 14 '22 13:11 josecsotomorales