coffee_boat issues

What about using pex instead?

1

https://github.com/pantsbuild/pex

holdenk

Add PySpark dep back in

1

For now we leave it out since many providers won't support it right now. Long story, buy me a :coffee: .

holdenk

Add support for local mode

holdenk

Support Spark on K8 better

Right now we do some terrible things with overriding the PYTHON_PATH, which is great and works in the general case. If the Spark+K8 folks end up integrating better first party...

holdenk

Enable pep8 tests

holdenk

Enable flake8 tests

holdenk

Update example to use arrow for vectorized UDF funtime

1

We currently have one example notebook, would be good to update the example to distribute PyArrow since this will be useful in Spark 2.3+ for vectorized UDF users.

holdenk

help wanted

good first issue

Investigate distribute venv cleanup

In theory most of what we do is with add files in Spark which should be handled, but the decompressed directory I'm less certain about. We should investigate this.

holdenk

Handle local cleanup better

Write now we create a bunch of temp files but don't really clean them up. There is a flag to do part of this but it needs to be tested...

holdenk

help wanted

Throw a clear error for no packages

holdenk

good first issue

coffee_boat
coffee_boat copied to clipboard

Metadata

What about using pex instead?

Add PySpark dep back in

Add support for local mode

Support Spark on K8 better

Enable pep8 tests

Enable flake8 tests

Update example to use arrow for vectorized UDF funtime

Investigate distribute venv cleanup

Handle local cleanup better

Throw a clear error for no packages

← Metadata

Owner

Metadata

coffee_boat coffee_boat copied to clipboard

Metadata

← Metadata

Owner

Metadata

coffee_boat
coffee_boat copied to clipboard