snowplow-rdb-loader icon indicating copy to clipboard operation
snowplow-rdb-loader copied to clipboard

Common: consider deleting functionality related to S3-discovery

Open chuwy opened this issue 4 years ago • 2 comments

In #232 we moved entirely to SQS discovery, but left some functionality related to discovering data on S3, mostly in ShreddedType modeul. I think that it will be helpful later to check integrity of the batch (do periodical check for abandoned batches and notify operators about it) and decided to not delete yet.

chuwy avatar Jan 21 '21 18:01 chuwy

What do you mean by "checking integrity of the batch" ?

benjben avatar Jan 22 '21 08:01 benjben

That for example there are no unexpected folders, like:

run=2021-01-18-23-32-30/vendor=com.acme/name=expected/format=tsv/model=1
run=2021-01-18-23-32-30/something=unexpected_stuff

Or model=A.

chuwy avatar Jan 22 '21 13:01 chuwy