blocklib icon indicating copy to clipboard operation
blocklib copied to clipboard

Python implementations of record linkage blocking techniques.

Results 28 blocklib issues
Sort by recently updated
recently updated
newest added

In the following code: ``` from blocklib import generate_candidate_blocks block_obj_alice = generate_candidate_blocks(alice['data'], blocking_config, header=alice['columns']) block_obj_alice.print_summary_statistics() ``` By executing the following line: ```block_obj_alice.print_summary_statistics()``` I got the error ``` AttributeError: 'CandidateBlockingResult' object...

The next version of the blocking schema should be consistent with use of under_scores, camelCase and hyphenated-keys..

Before computing similarity score computation and matching, we need to check if the number of encodings in blocking data is consistent with number of encodings in CLK data. Currently we...

Most of the [docs](https://blocklib.readthedocs.io/en/latest/blocking-schema.html#signaturespecs-configurations) don't show the newer/preferred way of defining a blocking schema with explicit column names. The functionality is mentioned in the tutorial, along with a single example:...

When I want to asses blocking result using metrics rr $ pc , I got the message error "ZeroDivisionError: float division by zero". How could I fix it?

Consider creating a python api as an alternative to creating signatures with JSON. At the moment we conflate signature specifications and data together a bit - e.g. `generate_by_char_at(attr_ind: int, dtuple:...

Possible extra strategies and set of names for signature generation: ## Existing strategies - ExactCharMatchSig: The letter at given index. (implemented as `generate_by_char_at`) - ExactMatchSig: the value of the whole...

>I don't think that filtering rare signatures is a good idea. I though we get the privacy protection by inserting the signatures into a Bloom filter. By carefully restricting the...

Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.0.3 to 2.0.6. Release notes Sourced from urllib3's releases. 2.0.6 Added the Cookie header to the list of headers to strip from requests when redirecting to a...

dependencies
python