SynapseML icon indicating copy to clipboard operation
SynapseML copied to clipboard

LightGBM on Apache Spark - support for scala 2.12

Open informatica92 opened this issue 5 years ago • 17 comments

Is there a reason why LightGBM on Apache Spark doesn't have a jar for scala 2.12? It would be nice to have support for both versions (2.11 and 2.12)

informatica92 avatar Aug 07 '20 14:08 informatica92

👋 Thanks for opening your first issue here! If you're reporting a 🐞 bug, please make sure you include steps to reproduce it.

welcome[bot] avatar Aug 07 '20 14:08 welcome[bot]

can i see your example?plz

shuDaoNan9 avatar Aug 08 '20 09:08 shuDaoNan9

Agreed, I've seen some conversations around this and I believe the latest was that there are some dependencies of mmlspark which don't yet support 2.11. Otherwise, if those are excluded somehow or resolved, I believe it is straightforward.

imatiach-msft avatar Aug 10 '20 04:08 imatiach-msft

Maybe a pre-lease supporting 2.12 (excluding incompatible dependencies) could be released? Maybe spark 3.0.0 support could also be added to that one?

brunocous avatar Aug 12 '20 14:08 brunocous

It would be great in my opinion... I had to downgrade Scala in order to make it work

informatica92 avatar Aug 13 '20 06:08 informatica92

We're holding up upgrading our Runtime framework, blocked on this upgrade, so this would be very helpful. Is anyone able to provide an ETA for when this will happen so we can plan around this better?

anupam-burra-zocdoc avatar Nov 16 '20 20:11 anupam-burra-zocdoc

related to #445 and #912 ?

juanpaulo avatar Jan 20 '21 07:01 juanpaulo

Looks like this (and related issues) is resolved by the recently merged PR: #970

juanpaulo avatar Feb 02 '21 04:02 juanpaulo

Does anyone know how to try with the nightly build?

nhymxu avatar Feb 08 '21 06:02 nhymxu

@nhymxu the nightly build is on github with a badge: image

imatiach-msft avatar Apr 21 '21 06:04 imatiach-msft

latest mmlspark support scala 2.12

imatiach-msft avatar Apr 21 '21 06:04 imatiach-msft

I have installed the latest MML Spark version(com.microsoft.ml.spark:mmlspark_2.11:1.0.0-rc3) on databricks with scala 2.12 & spark 3.0.1 but I am unable to run MMLSpark on it. While running, it gives error "java.lang.NoClassDefFoundError: org/apache/spark/ml/util/MLWritable$class". I believe this issue is because of the non-compatibility with Scala 2.12 or is there anything additional required to run it on Scala 2.12?

vinay-singh27 avatar Apr 29 '21 13:04 vinay-singh27

@vinay-singh27 just from this version:

com.microsoft.ml.spark:mmlspark_2.11:1.0.0-rc3

you must have installed the 2.11 scala version, not 2.12. RC3 release does not have 2.12 version, it's only available on master. However you can install any master build, for example: coordinate: com.microsoft.ml.spark:mmlspark_2.12:1.0.0-rc3-62-25d40cff-SNAPSHOT repository: https://mmlspark.azureedge.net/maven

you can see latest master in github main page

imatiach-msft avatar May 03 '21 04:05 imatiach-msft

Is there any chance we'll get the 2.12 version in Maven Central soon?

cosmincatalin avatar May 31 '21 12:05 cosmincatalin

@cosmincatalin we haven't been releasing to maven central for a while, we have always been releasing to our repository: https://mmlspark.azureedge.net/maven all of the rc1-rc3 releases have been there. Do you have any issues with using the repository instead of maven central? You should be able to specify the repository when getting spark packages on all clusters/platforms.

imatiach-msft avatar Jun 01 '21 03:06 imatiach-msft

That's exactly the problem I have, however, I think I have a fix for this, so we're good 😃

cosmincatalin avatar Jun 01 '21 08:06 cosmincatalin

Just to re-provide an answer for those looking for help

Our latest maven artifact is for 2.12 https://central.sonatype.com/artifact/com.microsoft.azure/synapseml_2.12/0.11.2

mhamilton723 avatar Jul 24 '23 15:07 mhamilton723