zingg
zingg copied to clipboard
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
I am trying to save the response from the html findTrainingData sampling exercise, which identified one matching pair but when i run the code to save samples as per this...
https://docs.zingg.ai/zingg0.4.0/stepbystep/runincremental
Hi, After the interactive labeler phase, when the below code runs : print(f'You have accumulated {n_pos} pairs labeled as positive matches.') print(f'You have accumulated {n_neg} pairs labeled as not matches.')...
Hi, I am trying to run the commands in FebrlExample.ipynb . But the command with ipymb widgets runs and just says Loading... I am using DBR 14.2 Please advise on...
"matchType" : "email" needs to be changed to fuzzy
In Data Sources and Sinks section, the AWS S3 part is empty
move `convertNamedListToStringList` in `FieldDefSelectedCols` to `Named` define interface method in inamed
Duplicate code like ``` for (int idx = 0; idx < sampleData.size(); idx++) { R row = rows.get(idx); for (Field column : fields) { String columnName = column.getName(); if (column.getType()...
check if in ClientOptions.java we get an exception if there is a null value - commented out code
I have two datasets: a "corpus" and a "query" database. I need to do active labeling only between those two datasets as the values themselves are already distinct for each...