Results 54 comments of Peixin

unlike spark-rapids-jni, there is no boost dep in our cudf docker ENV currently https://github.com/rapidsai/cudf/blob/branch-24.10/java/ci/Dockerfile.rocky cc @mythrocks to help check thanks

> Presumably the solution is to just add Boost to the JNI build container? > > Alternatively, the choice to always use a system boost was added by [@revans2](https://github.com/revans2) in...

merged workaround https://github.com/NVIDIA/spark-rapids/pull/11072 to unblock CI

according to https://github.com/numpy/numpy/issues/26191 the minimal required version of fastparquet should be `2024.5.0`, let me try to update all occurrences of `fastparquet == 0.8.3` or `fastparquet==0.8.3` in the repo to `2024.5.0`...

BTW numpy2 and fastparquet both **Requires: Python >=3.9**, as the current spark still has py3.8 support. Please let me know if we still need to test fastparquet in py3.8 ENV...

another repro in rapids_databricks_nightly-pre_release-github, run:701 (branch-25.04) `DATAGEN_SEED=1742879618` ``` FAILED ../../src/main/python/window_function_test.py::test_range_running_window_float_decimal_sum_runs_batched[1000][DATAGEN_SEED=1742879618, TZ=UTC, INJECT_OOM, IGNORE_ORDER({'local': True}), APPROXIMATE_FLOAT] - AssertionError: GPU and CPU float values are different [827, 'double_sum'] ``` ``` [2025-03-25T06:47:50.105Z] [2025-03-25T06:47:50.105Z]...

If the fix is our decision not to support pandas2&numpy2, please also help include fix to `pip install` parts in, jenkins/databricks/setup.sh jenkins/Dockerfile-blossom.integration.ubuntu jenkins/Dockerfile-blossom.integration.rocky jenkins/Dockerfile-blossom.ubuntu integration_tests/README.md (update the `Dependencies` section) and...

@amahussein Thanks for taking care of this! > Do you still want to support Python 3.8? > What is the highest python version. is it Python 3.11? Consider that pandas/numpy...

close with https://github.com/NVIDIA/spark-rapids/pull/11138. please let me know if we need some other fix, thanks

also cc @res-life to help, this case is still unstable in non-utc environment