Matt Ahrens
Matt Ahrens
Shouldn't impact our plugin code, can test to verify
In recent failure, the event log json is malformed which causes the profiler to fail: ```{"Event":"org.apache.spark.sql.execution.ui.SparkListenerSQLExecution{"Event":"org.apache.spark.sql.execution.ui.SparkListenerSQLE{"Event":"org.apache.spark.sql.execution.ui.SparkListenerSQLExecutionStart","executionId":2,"description":"src/main/python/conditionals_test.py::test_case_when[Float] (call)...``` It might be an issue with Databricks logging in that it's intermittent and...
For the recent tools job failures, I see these exceptions show up near the end of the previous pytests job (and the exceptions are not seen on days where all...
Is parallelism set to 4 for the pytests execution? Not sure how to interpret this command, but I see `-n 4`: ``` exec python /home/ubuntu/integration_tests/runtests.py --rootdir /home/ubuntu/integration_tests /home/ubuntu/integration_tests/src/main/python -n 4...
Could you estimate how much longer the pytests execution would take if n was set to 1 or 2, compared to how long it currently takes? That is likely what...
What specific types of applications were benchmarked? The estimates that we have done are the NDS benchmark @ SF3K and the accuracy ranges by query application but it generally within...
@jayadeep-jayaraman can you help review this?
Related to: https://github.com/NVIDIA/spark-rapids/issues/10741
Can you document what the datagen seed was for original failure and try to repro it? We want to keep this open for original failure with what the datagen seed...
Can you give more details on the test failures that necessitate this change?