ecosystem icon indicating copy to clipboard operation
ecosystem copied to clipboard

Does this connector work with TF 2.x?

Open dgoldenberg-audiomack opened this issue 4 years ago • 4 comments
trafficstars

The latest TF right now is 2.4.0. The latest connector on maven central is 1.15.0 published on Oct 23, 2019.

If I build the connector with the instructions from here: https://github.com/tensorflow/ecosystem/tree/master/spark/spark-tensorflow-connector, will it work?

dgoldenberg-audiomack avatar Dec 19 '20 23:12 dgoldenberg-audiomack

Hi @dgoldenberg-audiomack, when I build the connector in TF 2.0.0, I got fails in the build test stage ( link ), how about you?

Echo9573 avatar Jan 06 '21 01:01 Echo9573

Hey @Echo9573, I think your error is:

/bin/sh: 1: java: not found

judging by the output. However you can try running Maven with -e and -X to get more info. It seems like either you don't have Java installed or it's not on your PATH.

dgoldenberg-audiomack avatar Jan 06 '21 02:01 dgoldenberg-audiomack

I think the connector works; would be great if committers verified and added a blurb to the docs.

dgoldenberg-audiomack avatar Jan 06 '21 02:01 dgoldenberg-audiomack

I tried to follow the instructions with TF 2.2 and Spark 3.0.1, but installing ecosystem/hadoop has a missing dependency to org.tensorflow:proto:jar:2.2.0. The error message: Could not resolve dependencies for project org.tensorflow:tensorflow-hadoop:jar:2.2.0: Could not find artifact org.tensorflow:proto:jar:2.2.0 in central (https://repo.maven.apache.org/maven2)

Instructions pass if I manually set org.tensorflow:proto to TF version 1.15.0. However, I'm unsure what is the impact of using an old version of org.tensorflow:proto.

For the build I used dataproc preview-debian10 master image and these commands:

cd ../../hadoop
mvn versions:set -DnewVersion=2.2.0
mvn clean install
cd ../spark/spark-tensorflow-connector
mvn versions:set -DnewVersion=2.2.0
mvn clean install -Dspark.version=3.0.1

jukujala avatar Jan 13 '21 08:01 jukujala