Haejoon Lee
Haejoon Lee
- Is the change of python/pyspark/pandas/resample.py safe? It breaks the previous behavior, so if we plan to release other minor release (Spark 3.6.0) this should not be included. - What...
We should not bring any breaking change. Let me address them. Thanks, @dongjoon-hyun for double checking.
Oh, wait. I just remembered that we just follow the Pandas behavior and separately mention the breaking changes into [release note](https://github.com/apache/spark/blob/master/python/docs/source/migration_guide/pyspark_upgrade.rst). ``` - In Spark 4.0, it is recommended to...
Just updated to resample work in old Pandas as well. I think we can just make it as deprecate for now to avoid breaking the existing pipeline. (Also updated the...
Thank you so much all for the review!
Thanks @HyukjinKwon for reviewing. Just fixed regressions from past few PRs, and updated the PR title & description accordingly.
ditto. Please see https://github.com/databricks/koalas/issues/2223#issuecomment-1789845928.
Looks fine for now, but maybe in the future we might need to separate this into more parts instead of just rebalancing if the number of test will be increased.
Let's use the default PR template: ``` ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change?...
Oh, okay seems like the NumPy is upgraded their major version recently (2024-06-17): [Release Note](https://github.com/numpy/numpy/releases/tag/v2.0.0). @HyukjinKwon Maybe should we upgrade the minimum NumPy support to 2.0.0 as we did for...