clkhash
clkhash copied to clipboard
Understand restrictions in bloomfilter length
While reading Options for encoding names for data linking at the Australian Bureau of Statistics I came across this note regarding restrictions on the bloom filter's modulus:
In particular note:
m
must be prime.
Following the reference in Bloom filters in probabilistic verification is this note suggesting that a power of 2 is also an option:
cc: @wilko77
Aha! Link: https://csiro.aha.io/features/ANONLINK-48