splink
splink copied to clipboard
Error if input dataframes already have a column named `source_dataset`
You get:
The error: AnalysisException: Found duplicate column(s) when inserting into xxx: `source_dataset`
Need to validate column names at the start for linkage job