project-website icon indicating copy to clipboard operation
project-website copied to clipboard

Adds new blog post announcing opensearch hadoop

Open harshavamsi opened this issue 2 years ago • 15 comments
trafficstars

Description

Adds a new blog post announcing the availability of the hadoop client

Issues Resolved

[List any issues this PR will resolve]

Check List

  • [X] Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

harshavamsi avatar Jun 02 '23 22:06 harshavamsi

This would be awesome to have!

harshavamsi avatar Jun 06 '23 20:06 harshavamsi

This is a good start. I like having the compatibility matrices. Might be good, though, to also add a simple "Getting Started Examples"?

Maybe an example on how to write to a dataframe in scala?

e.g.,

val spark = SparkSession.builder().master("local[*]")
    .config("opensearch.nodes", "127.0.0.1").config("opensearch.net.http.auth.user", "admin").config("opensearch.net.http.auth.pass", "admin").config("opensearch.net.ssl", "true")
    .config("opensearch.batch.size.bytes", "1kb").config("opensearch.net.ssl.cert.allow.self.signed", "true")
    .getOrCreate()

or how to use it with pyspark like I demonstrate in my comment on #153.

I'm happy to add if you'd like.

Go for it!

wbeckler avatar Jun 07 '23 20:06 wbeckler

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

hdhalter avatar Jun 14 '23 20:06 hdhalter

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

nknize avatar Jun 15 '23 01:06 nknize

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

Hi Nick, this is still awaiting your input. Thank you!!

wbeckler avatar Jun 26 '23 15:06 wbeckler

@nknize @mnkugler @wbeckler - If you can make the final edits, update he blog date, and let @krisfreedain know when it's ready to go, we can get this posted to the blog tomorrow. Otherwise, we'll need to hold this until next Wednesday.

pajuric avatar Jun 29 '23 02:06 pajuric

Otherwise, we'll need to hold this until next Wednesday.

Let's hold to Wednesday. I was working up the example with the published artifacts and noticed they don't support Spark 3. We may want to republish the Spark 3 artifacts before publishing the blog.

nknize avatar Jun 29 '23 02:06 nknize

@mnkugler and @wbeckler - Are we good to publish this today?

pajuric avatar Jul 07 '23 15:07 pajuric

Still waiting on @nknize's changes.

wbeckler avatar Jul 07 '23 15:07 wbeckler

@pajuric The blocker right now is that the released OpenSearched-Hadoop artifacts are not compatible with Spark 3. Thus the compatibility matrix in this blog post is not correct and the example code I'm providing will not work for the users / readers running Spark 3:

e.g.,

[error] Modules were resolved with conflicting cross-version suffixes in ProjectRef(uri("file:/...
[error]    org.apache.spark:spark-core _2.13, _2.11

From example build.sbt

ThisBuild / scalaVersion := "2.13.0"

lazy val root = (project in file("."))
  .settings(
    name := "opensearch-spark-example"
  )

libraryDependencies ++= Seq(
  "org.apache.spark" %% "spark-core" % "3.2.4" exclude("javax", "servlet") exclude("org.apache", "hadoop"),
  "org.opensearch.client" % "opensearch-hadoop" % "1.0.1",
  "org.antlr" % "antlr4-runtime" % "4.8",
  "org.codehaus.janino" % "commons-compiler" % "3.0.8",
  "org.codehaus.janino" % "janino" % "3.0.8"
)

We need to publish the Spark 3 compatible version which is built and packaged with the artifacts from the spark/sql-30 module

nknize avatar Jul 07 '23 18:07 nknize

I opened an issue to move this forward: https://github.com/opensearch-project/opensearch-hadoop/issues/304

nknize avatar Jul 07 '23 18:07 nknize

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

pajuric avatar Aug 21 '23 16:08 pajuric

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

@pajuric I've not heard from the authors in a while. I'm adding them to this comment, as they need to provide the update.

@nknize @harshavamsi Please update on the status of this blog. Is the text final and ready for an editorial review?

vagimeli avatar Aug 21 '23 16:08 vagimeli

@wbeckler @Xtansia - Please provide an update on the blog, as I understand it has been transferred over to you both.

pajuric avatar Nov 02 '23 15:11 pajuric

@wbeckler - Are we OK to close this blog?

pajuric avatar Jul 25 '24 15:07 pajuric