zingg icon indicating copy to clipboard operation
zingg copied to clipboard

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Results 147 zingg issues
Sort by recently updated
recently updated
newest added

one frequent use case we encounter with customers is adding geospatial data types and matching within those. for example a point belongs to a building or a lot. we should...

We need to provide the right signals to the user in terms of how the model is converging and how it is performing on the training. right now it is...

**Is your **feature request** related to a problem? Please describe.** Marked recorded are stored as individual Parquet files. Parquet is an "immutable" binary format and difficult to edit and view...

**Is your feature request related to a problem? Please describe.** Zingg is a bit magical to me. Learning Spark and different strategies to cluster the entities is to be solved...

Matching can be greatly improved if we can segment and tag individual attributes, for example in address. check if we can use libpostal or build somethign similar which is generic....

Snowpark doesnt have ready to use ml and graph libs..investigate what we can build or use.

Many er use cases warrant real time - see how we can do that

Current logs do not print locations or names of pipes for training and test data etc making it difficult to understand things from user provided logs.

Databricks can load Maven jars directly, https://kb.databricks.com/libraries/maven-library-version-mgmt.html - so let us publish there and let users pick it up instead of manually copying

databricks