SynapseML icon indicating copy to clipboard operation
SynapseML copied to clipboard

Add scala notebook examples

Open kschelonka opened this issue 5 years ago • 7 comments

Ports two of the basic python notebook examples to scala using zeppelin notebooks for https://github.com/Azure/mmlspark/issues/166 and https://github.com/Azure/mmlspark/issues/453 Github doesn't have a native viewer for zeppelin, so I included instructions on how to view the files using the zepl viewer in the readme. They can also be downloaded and loaded into zeppelin to run.

Quick links here: Classification - Adult Census Regression with L-BFGS

kschelonka avatar May 30 '19 06:05 kschelonka

CLA assistant check
All CLA requirements met.

msftclas avatar May 30 '19 06:05 msftclas

Thank you very much for this contribution @kschelonka! As a heads up, we will be moving to a newer and simpler build system in the next few days. Once I switch there, I can work on making these notebooks end to end tests on Databricks like our other notebooks are. If you can move these to work on databricks with the assumption that the proper version of mmlspark is installed the work can go a lot faster. We really appreciate your help and look forward to getting these in!

mhamilton723 avatar May 31 '19 15:05 mhamilton723

Thanks for the comment. I've never used Databricks before, but from reading the documentation it seems like the cluster can execute scala files, so I can convert these examples to scala programs.

kschelonka avatar Jun 03 '19 19:06 kschelonka

hey @kschelonka the build is now freshly refactored and we can begin building the infra to support these tests. I believe it should be a matter of adding the proper settings in the src/it/.../DatabricksUtilities.scala file.

mhamilton723 avatar Jul 06 '19 03:07 mhamilton723

Thanks, I'll take a look!

kschelonka avatar Jul 12 '19 14:07 kschelonka

Sorry for forgetting about this! I'm happy to pick it back up and integrate the updates. @mhamilton723 is the desired format still a scala file that will be executed on Databricks?

kschelonka avatar May 13 '20 22:05 kschelonka

Sorry for forgetting about this! I'm happy to pick it back up and integrate the updates. @mhamilton723 is the desired format still a scala file that will be executed on Databricks?

@kschelonka Sorry for the delay in response, are you still the owner of this PR and willing to make the required changes? If not, can you tag the person who will be owning this? We would like to coordinate in completing the pull request.

KeerthiYandaOS avatar Jan 31 '22 19:01 KeerthiYandaOS