Matthew Powers
Matthew Powers
[The documentation](https://delta-io.github.io/delta-rs/python/usage.html#loading-a-delta-table) currently uses import statements like this: ```python from deltalake import DeltaTable from deltalake import DataCatalog ``` I usually see Python import statements like this: ```python import pandas as...
Will want to expose the "missing API functions" (e.g. regexp_extract_all) for the PySpark folks too. Think we'll be able to follow [this example](https://github.com/apache/spark/blob/master/python/pyspark/sql/functions.py#L1911-L1924) and expose these relatively easily. Something like...
Should be similar to [what's published for spark-daria](https://mrpowers.github.io/spark-daria/latest/api/#package).
These are functions that are not implemented in Spark, but commonly requested. They should be implemented as Catalyst Expressions so they're performant for the community: Datetime - [X] beginningOfDay -...
We can use this issue to create a list of all the functions that are in Spark SQL, but not in the Scala API for whatever reason. Here's the list...
I think we should bump to Spark 2.4.5 & see what can features can get added to Spark 2. This way they'll be some JAR files for the Spark 2...
* ignore_nullable * allow_nan_equality * ignore_column_order * ignore_row_order
This project is setup for Poetry development. Is there a way to setup this project for conda development as well? I want to keep using Poetry for my personal development...
Maybe I'm the outlier, but I consider the more intuitive check -- especially for testing purposes -- to ignore order. If some function produces a DataFrame that I want to...