Evgenii Ignatev comments

Results 5 comments of


                                            Evgenii Ignatev

"IsIn" pushdown for empty sequence generates invalid SQL

Created a PR to demonstrate my suggestion - https://github.com/snowflakedb/spark-snowflake/pull/252

"IsIn" pushdown for empty sequence generates invalid SQL

@sfc-gh-zli Hello, Actual code is quite large and interconnected, looks like can be outlined as roughly: `col_vals_set = set() # Set can be computed empty depending on the previous code.`...

"IsIn" pushdown for empty sequence generates invalid SQL

Also in my example empty set is used directly, not list.

What does ids.num-partitions do?

Also it is currently not clear how `cluster.max-partitions` and `ids.num-partitions` are correlated and this topic is not covered by docs.

Brainstorming functions to make PySpark easier

@MrPowers Small proposal - maybe adding UUID5 (not as complete as Python version obviously, but better than nothing) generator? - https://github.com/YevIgn/pyspark-uuid5/blob/2055a4aa8429424ef79c248f78aba2a33e462806/src/research_udf_performance.py#L158 - recently I made an attempt to write one,...