fingerprint-securedrop
fingerprint-securedrop copied to clipboard
A machine learning data analysis pipeline for analyzing website fingerprinting attacks and defenses.
We don't really know the base rate of SecureDrop usage: yay anonymity. However, this means that in order to see how many users an attacker would correctly and incorrectly flag...
Of Python libraries that come to mind, `python-gnupg` has the nicest docstrings in my opinion. Poke around https://github.com/isislovecruft/python-gnupg/blob/master/gnupg/. You learn: - All parameters you can pass. - The type of...
The "sorter" tells us the state of onionspace based on public directories of onion services. We should ensure that this is being executed every night (via cron) such that the...
Selenium's page load timeout function is highly unreliable. If it doesn't close down a connection within 5s of when it's supposed to, we should stop a crawl by whatever means...
We have a very imbalanced machine learning problem, where we have far fewer SecureDrop users than non-SecureDrop users. There are many ways of handling this situation - including oversampling the...
Many of our features are not very useful. We should include a first step of feature selection before passing the features matrix to the classifier. This could be something simple,...
Right now we generate a ROC curve that is averaged over all k-folds, but we don't save the FPR and TPR in the database in `models.undefended_frontpage_attacks`. We should do that...
Rewrite the burst generation code currently performed in `FeatureStorage.create_bursts()` in `features.py` in SQL such that feature generation is fast and scalable
http://xnsoeplvch4fhk3s.onion/ stalls the crawler indefinitely. The 20s page load timeout variable should kill the connection, but for some reason Selenium fails to do so with this site. Here's the Firefox...