geni icon indicating copy to clipboard operation
geni copied to clipboard

A Clojure dataframe library that runs on Spark

Results 14 geni issues
Sort by recently updated
recently updated
newest added

Adds support for [Delta Lake](https://delta.io/) storage format. This PR is best reviewed 1 commit at a time. Currently, some core functionality of Delta is disabled due to a [known incompatibility...

- [x] I have read through the quick start and installation sections of the [README](../README.md). ## Info | Info | Value | | --- | --- | | Operating System...

- [X] I have read through the quick start and installation sections of the [README](../README.md). ## Info | Info | Value | | --- | --- | | Operating System...

[Delta Lake](https://delta.io/) brings a lot of crucial features into the Spark ecosystem. Some of the highlights include: 1. ACID transactions. 2. Time travel between data versions. 3. Safe in-place updates,...

There is a pre-release for Spark NLP on Spark 3.0.0 This might it possible to use it with Geni ? https://github.com/JohnSnowLabs/spark-nlp/releases/tag/3.0.0-rc8

Is it feasible to use a sparkml java package like https://graphframes.github.io/graphframes/docs/_site/index.html with Geni? Or is this outside of the scope?

- [x] I have read through the quick start and installation sections of the [README](../README.md). ## Info | Info | Value | | --- | --- | | Operating System...

Creating histograms is a very common activity. Geni offers [`cut`](https://github.com/zero-one-group/geni/blob/develop/src/clojure/zero_one/geni/core/foreign_idioms.clj#L158) which supports the creation of histograms as a function of `bins`, an array of values, but the user has to...

Geni users would benefit from support for Spark User Defined Functions on dataframes as documented [here](https://spark.apache.org/docs/latest/sql-ref-functions-udf-scalar.html). UDFs are very useful for data analysis from the simple classification of continuous values...

Following the minikube guide: https://github.com/zero-one-group/geni/blame/develop/docs/kubernetes_basic.md the verification of line 118 fails. It seems that I cannot change the spark-session, by calling `g/create-spark-session` I am pretty sure, that it worked at...