anonlink icon indicating copy to clipboard operation
anonlink copied to clipboard

Investigate effect of not including start/end bigrams

Open hardbyte opened this issue 7 years ago • 1 comments

When creating the bi-grams, the first and last bi-gram are padded with a whitespace.

This is a weakness, because it allows an attacker to more easily to find the beginning and the end of a word. Intuitively it helps with matching so we should investigate if dropping the padding decreases matching accuracy.

Aha! Link: https://csiro.aha.io/features/ANONLINK-72

hardbyte avatar Jan 09 '18 02:01 hardbyte

From Cryptanalysis of Basic Bloom Filters paper:

screenshot from 2018-01-09 14-07-43

hardbyte avatar Jan 09 '18 03:01 hardbyte