tempo
tempo copied to clipboard
Improve AS OF Join Performance through Max Lookback parameter
Currently, AS OF joins require a window to be retained in memory per partition key (key for the Spark window). Improve may benefit from only looking back a certain number of records (or using a range if appropriate). This ticket should have:
- Tests showing perf improvement
- Prototype with a lookback parameter option (records lookback or range)
- Feedback from a user to see whether this type of approximation is acceptable.
Would this be covered by the new tolerance
parameter added by @nina-hu in #304?