OpenWPM
OpenWPM copied to clipboard
Remove instance_id
I've never found the instance_id useful, but it's still in the schemas and I've noticed it isn't in any tables from my latest crawl. If that's intentional (or maybe something went wrong with my crawl), we should clean up the schema?
@englehardt , Hello, I wanted to work on this issue. After going through the repo, does this issue require to remove all the statements like pa.field("instance_id", pa.uint32(), nullable=False) from different fields and responses in parquet_schema.py file ?
Yes, you need to remove it from the parquet_schema but also from the S3 Aggregator