Sonal issues

Results 138 issues of


                                            Sonal

SQL based blocking and distance functions

What if we could take sql from say a dbt model or otherwise and use that for our model training - blocking as well as similarity? Then non Java programmers...

enhancement

Clean up code in Labeller and UpdateLabeller

Right now there is a lot of repeat code in the labeller and update labeller classes - for ex execute method could be in one place. Code is pretty hard...

technicalDebt

make the interactive learner color coded

It will improve readability quite a bit if - headers were bold - options yes/no etc were color coded

enhancement

Command line data stewardship

We can build a cli that can filter and show results to the user, much like the labeller

enhancement

Running training when dataset only has matches/non matches or limited samples throws errors. We should instead inform the user about this so they can add training samples.

Reported by Luke from Databricks [zingg_Dec21_0823_log4j-active (1).txt](https://github.com/zinggAI/zingg/files/7761163/zingg_Dec21_0823_log4j-active.1.txt) [zingg_Dec21_0823_sdtderr.txt](https://github.com/zinggAI/zingg/files/7761167/zingg_Dec21_0823_sdtderr.txt)

Sonal

SQL based blocking and distance functions

Clean up code in Labeller and UpdateLabeller

make the interactive learner color coded

Command line data stewardship

Running training when dataset only has matches/non matches or limited samples throws errors. We should instead inform the user about this so they can add training samples.

Error reporting framework

Online Arguments creator through JSON Editor?

the blocking algo needs to be thought through for descriptions and other kinds of data

do we need a fellegi sunter model as well?

Add case studies