Anton Parkhomenko

Results 165 issues of Anton Parkhomenko

Migrated from https://github.com/snowplow/snowplow/issues/3445#issuecomment-333064293 Right now we're generating random UUID, which makes all natural duplicates synthetic. We should throw exception and abort shredding instead.

One of our users mistakenly sent a `com.snowplowanalytics.snowplow/contexts` payload and Loader failed with following error: ``` Data discovery error with following issues: JSONPath file [com.snowplowanalytics.snowplow/contexts_1.json] was not found ``` Although,...

We have a chance of race condition, breaking the load when two pipelines are involved. With current default behavior: 1. Two pipelines *Big* and *Small* are loading data to same...

We encountered a case, where user has deleted all data from a single table. During usual load process Loader always checks if `atomic` data is present and aborts if it...

Background: https://discourse.snowplowanalytics.com/t/rdbloader-postgresql-error/2059/5 We're downloading data to Loader's node anyway, so we can extract data on-fly.

From my experience, enriched data assumes that raw data was not just enriched, but also validated - we never add invalid contexts/unstruct events to final enriched event. Yet validation is...

Until R29, `--dry-run` would work as expected even without connecting to Redshift - because its execution didn't depend on `atomic.manifest` nor any other tables. However, because of #14 and #70...

Migrated from https://github.com/snowplow/snowplow/issues/2217

[S3Guard](https://hortonworks.com/blog/s3guard-amazon-s3-consistency/) is an experimental s3a-plugin that uses DynamoDB as an intermediate metadata store. Right now this is part of Hadoop Common 2.9 and should be considered unstable. Also I'm not...

Cannot promise this will be implemented, @acgray, but what would your implementation look like? Is it another setting in `aws.s3.buckets`?