Zhenya Warshavsky
Zhenya Warshavsky
Data models see next comment. "Step 2" model caused issues on Sagemaker with 1m row input file. Both times the issue encountered was on the OS vs kernel level. The...
model 1: ``` fields = [ {'field': 'email', 'type': 'String'}, {'field': 'phone_number', 'type': 'String'} ] ``` model 2: ``` fields = [ {'field': 'full_name', 'type': 'Name'}, {'field': 'full_address', 'type': 'Address'...
On deduper.partition()
Same issue. Would love an update.