spark
spark copied to clipboard
[SPARK-52327][Core] Glob based provider for history server
What changes were proposed in this pull request?
A new provider for history server is added for consuming event logs from multiple directories based on a glob instead of a single base directory.
Why are the changes needed?
There are usecases for eg., multi tenancy where a logical separation for event logs is required in which case the history server would have to read logs from multiple directories.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Unit tests are added
Was this patch authored or co-authored using generative AI tooling?
No
+CC @thejdeep
@thejdeep @mridulm Requesting you to help with the review please.
++ @LuciferYang @dongjoon-hyun