OpenWPM
OpenWPM copied to clipboard
A web privacy measurement framework
https://github.com/openwpm/OpenWPM/blob/abf10d745a42f6781a1b6116976a613900f73977/docs/Configuration.md?plain=1#L162-L174 The statement: ``` - `collection_fingerprinting` is the default if `js_instrument` is `True`. ``` Is false, as it is no longer the default. I found this a bit confusing while...
Geckodriver now supports full page screenshots (https://github.com/SeleniumHQ/selenium/pull/7182) So https://github.com/mozilla/OpenWPM/blob/aa5e2a6fe628e67953016f79fddd6e9c9096ac7f/automation/CommandSequence.py#L106-L130 can be updated to work with moden geckodriver. (originally added by pr https://github.com/mozilla/OpenWPM/pull/156)
There are a couple of places that document the limitations of geckodriver 0.15. As we're now on 0.26, we should check these references and update them or update the code...
I've never found the instance_id useful, but it's still in the schemas and I've noticed it isn't in any tables from my latest crawl. If that's intentional (or maybe something...
Conda says it can't satisfy the condition that glibc >=2.17 but the installed glibc is on version 2.26 as per the user-agent field in conda info. @birdsarah any idea why...
Otel
No intention to ever ship this unless someone else requests it, but I wanted to play around with it to better understand where OpenWPM is spending time and where it...
Helloo, I was involved in a project couple of years ago where we analyzed redirect chains. Back then, we could use old_channel_id and new_channel_id in order to track the redirect...
Hi, i'm running large crawls, but as i noticed temp files are not getting removed as sometime passes or crawls move forward openwpm_profile_archive_{some random number} --> each almost more than...
Fixes #1020 WIP. I haven't tested this much.