project-website Adds new blog post announcing opensearch hadoop

trafficstars

Description

Adds a new blog post announcing the availability of the hadoop client

Issues Resolved

[List any issues this PR will resolve]

Check List

[X] Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

Jun 02 '23 22:06 harshavamsi

This would be awesome to have!

Jun 06 '23 20:06 harshavamsi

This is a good start. I like having the compatibility matrices. Might be good, though, to also add a simple "Getting Started Examples"?

Maybe an example on how to write to a dataframe in scala?

e.g.,
val spark = SparkSession.builder().master("local[*]")
    .config("opensearch.nodes", "127.0.0.1").config("opensearch.net.http.auth.user", "admin").config("opensearch.net.http.auth.pass", "admin").config("opensearch.net.ssl", "true")
    .config("opensearch.batch.size.bytes", "1kb").config("opensearch.net.ssl.cert.allow.self.signed", "true")
    .getOrCreate()
or how to use it with pyspark like I demonstrate in my comment on #153.

I'm happy to add if you'd like.

Go for it!

Jun 07 '23 20:06 wbeckler

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Jun 14 '23 20:06 hdhalter

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

Jun 15 '23 01:06 nknize

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

Hi Nick, this is still awaiting your input. Thank you!!

Jun 26 '23 15:06 wbeckler

@nknize @mnkugler @wbeckler - If you can make the final edits, update he blog date, and let @krisfreedain know when it's ready to go, we can get this posted to the blog tomorrow. Otherwise, we'll need to hold this until next Wednesday.

Jun 29 '23 02:06 pajuric

Otherwise, we'll need to hold this until next Wednesday.

Let's hold to Wednesday. I was working up the example with the published artifacts and noticed they don't support Spark 3. We may want to republish the Spark 3 artifacts before publishing the blog.

Jun 29 '23 02:06 nknize

@mnkugler and @wbeckler - Are we good to publish this today?

Jul 07 '23 15:07 pajuric

Still waiting on @nknize's changes.

Jul 07 '23 15:07 wbeckler

@pajuric The blocker right now is that the released OpenSearched-Hadoop artifacts are not compatible with Spark 3. Thus the compatibility matrix in this blog post is not correct and the example code I'm providing will not work for the users / readers running Spark 3:

e.g.,

[error] Modules were resolved with conflicting cross-version suffixes in ProjectRef(uri("file:/...
[error]    org.apache.spark:spark-core _2.13, _2.11

From example build.sbt

ThisBuild / scalaVersion := "2.13.0"

lazy val root = (project in file("."))
  .settings(
    name := "opensearch-spark-example"
  )

libraryDependencies ++= Seq(
  "org.apache.spark" %% "spark-core" % "3.2.4" exclude("javax", "servlet") exclude("org.apache", "hadoop"),
  "org.opensearch.client" % "opensearch-hadoop" % "1.0.1",
  "org.antlr" % "antlr4-runtime" % "4.8",
  "org.codehaus.janino" % "commons-compiler" % "3.0.8",
  "org.codehaus.janino" % "janino" % "3.0.8"
)

We need to publish the Spark 3 compatible version which is built and packaged with the artifacts from the spark/sql-30 module

Jul 07 '23 18:07 nknize

I opened an issue to move this forward: https://github.com/opensearch-project/opensearch-hadoop/issues/304

Jul 07 '23 18:07 nknize

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

Aug 21 '23 16:08 pajuric

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

@pajuric I've not heard from the authors in a while. I'm adding them to this comment, as they need to provide the update.

@nknize @harshavamsi Please update on the status of this blog. Is the text final and ready for an editorial review?

Aug 21 '23 16:08 vagimeli

@wbeckler @Xtansia - Please provide an update on the blog, as I understand it has been transferred over to you both.

Nov 02 '23 15:11 pajuric

@wbeckler - Are we OK to close this blog?

Jul 25 '24 15:07 pajuric

project-website project-website copied to clipboard

Adds new blog post announcing opensearch hadoop

Description

Issues Resolved

Check List

project-website
project-website copied to clipboard