OpenWPM icon indicating copy to clipboard operation
OpenWPM copied to clipboard

Remove instance_id

Open birdsarah opened this issue 5 years ago • 2 comments

I've never found the instance_id useful, but it's still in the schemas and I've noticed it isn't in any tables from my latest crawl. If that's intentional (or maybe something went wrong with my crawl), we should clean up the schema?

birdsarah avatar May 30 '20 07:05 birdsarah

@englehardt , Hello, I wanted to work on this issue. After going through the repo, does this issue require to remove all the statements like pa.field("instance_id", pa.uint32(), nullable=False) from different fields and responses in parquet_schema.py file ?

Prajwal7842 avatar Oct 06 '20 16:10 Prajwal7842

Yes, you need to remove it from the parquet_schema but also from the S3 Aggregator

vringar avatar Oct 06 '20 18:10 vringar