Brian Thorne
Brian Thorne
The PrivateSum project in examples/ folder is not compatible with java 1.7 and just works with version 1.8. - [ ] ensure the ci builds the example projects - [...
@RacingTadpole has carried out an independent code review of clkhash. ### Summary > No major issues identified. The code looks well structured. The two most important issues would be clearing...
In **Encoding hierarchical classification codes for Privacy-preserving Record Linkage using Bloom filters** _Rainer Schnell_ and _Christian Borgs_ introduce encoding Hierarchical classification codes into bloom filters: > > Hierarchical classification codes...
Pypy3 runs the unit tests in ~1m 41s versus Python 3.6 taking ~23s. Example build on [azure devops](https://dev.azure.com/data61/Anonlink/_build/results?buildId=1802&view=results). This issue is to identify why it is slower, and ideally solve...
Ideas: - version of clkhash used - size and statistics of clks - schema (or hash of schema) - hash of clks - timestamp (when the PII was encoded)
While reading [Options for encoding names for data linking at the Australian Bureau of Statistics](https://arxiv.org/abs/1802.07975) I came across this note regarding restrictions on the bloom filter's modulus: ![screenshot from 2018-02-24...
Consider if the right levels of abstraction have been made for a library user and document options to improve. It should be relatively easy for a clkhash user to define...
An experimental api has been added for uploading CLKs as a binary file. This is to allow for faster and more efficient data transfer. The same rest endpoint (`/projects/{project_id}/clks`) is...
Add a page to the docs with information about supported platforms including any special instructions on how to install dependencies e.g. Visual Studio C++ compiler on Windows. Perhaps worth looking...
We should rethink defaults as currently: * `clkhash` ignores the values in the spec * the defaults are spread throughout the code base. Either hard-coded (e.g. schema.py line 184), default...