rportilla-databricks comments

Results 12 comments of


                                            rportilla-databricks

Support for structured streaming

Hi thanks for the question! We do not yet support structured streaming - but do you have a use case you're interested in? For example streaming AS OF joins? On...

Support for structured streaming

Thanks for the detailed explanation! We can help with this - would you mind sending over your email so I can set up a meeting. On Tue, Jun 8, 2021...

Support for structured streaming

@Sonali-guleria , just for reference.

asofJoin extremely slow when result has 99 or more columns

Thanks for the note @rzsquirrel , this is actually due to catalyst not being enabled with there are 100+ columns and is a Spark-related effect. Putting the values in a...

asofJoin extremely slow when result has 99 or more columns

@rzsquirrel , as a workaround, can you try updating this parameter to a column count higher than 100? spark.sql.codegen.maxFields This is why codegen fails to be enabled - I'm wondering...

asofJoin extremely slow when result has 99 or more columns

@rzsquirrel , coming back to this issue, can you provide the timing on your tests above? When joining 100 cols to 100 columns, I'm getting a 20s runtime for the...

Enhancement request: Apply Custom Time Series Functions

@BenLBurke , this is definitely an interesting request. Can you explain what functions you are looking for in particular? The reason we ask is because applyInPandas is the go-to method...

AsOfJoin with subset of partitioning columns on right side

Hi Martin, we will take a look at this in a meeting next week to prioritize. Thanks for sending!

tempo inside `foreachBatch`

@sim-san , we will be supporting streaming for select operations. The first function which will support streaming is the asofJoin. Resample/interpolation will come later and are not scoped for the...

Remove partition columns from Z Order optimization in `io.py`

We should be able to remove partition columns. On Mon, Aug 15, 2022 at 11:41 AM Lorin Dawson ***@***.***> wrote: > In io.py we Z ORDER on partitionCols + optimizationCols...