snowplow-rdb-loader issues

AWS streaming transformer should use default chain to set S3 region

In most places snowplow apps let the aws sdk figure out the region using the default chain. But AWS streaming transformer uses hadoop for writing to S3 (only for parquet...

istreeter

Support for wide row tables on Redshift

1

istreeter

cla:no

Kinesis transformer: fails to process event with multiple schema versions

The transformer client is not able to decode server response containing multiple schema versions. ```json { "schema": "iglu:com.snowplowanalytics.snowplow.badrows/loader_iglu_error/jsonschema/2-0-0", "data": { "processor": { "artifact": "snowplow-transformer-kinesis", "version": "5.4.0" }, "failure": [ {...

dkucharc

Change `domain_sessionid` to be a varchar instead of a `char`

Currently for redshift and snowflake the `domain_sessionid` column is loaded as a `char(128)` when most other columns are a `varchar(128)`. In snowflake this doesn't actually matter as snowflake does not...

rlh1994

Transformer v5.7.0 is utilizing much more memory and cpu since version 5.3.0

2

I just did an update of transformer-kinesis and databricks-loader from version 5.3.0 to version 5.7.0. Before and after the upgrade I ran a stress test using Taurus. After both test...

dkbrkjni

loader: testing with cloud

This PR contains automated tests for Snowflake Loader on Azure. It brings necessary building blocks to add tests for other destinations and cloud types as well. Test class structures are...

spenes

cla:yes

dilyand

Redshift Loader: Handle missing columns in CSV

2

This issue is about schema evolutions which add new columns. There is a problem that arises when the data is transformed using the older schema, but attempted to load using...

istreeter

snowplow-rdb-loader
snowplow-rdb-loader copied to clipboard

Metadata

AWS streaming transformer should use default chain to set S3 region

Support for wide row tables on Redshift

Kinesis transformer: fails to process event with multiple schema versions

Change `domain_sessionid` to be a varchar instead of a `char`

Transformer v5.7.0 is utilizing much more memory and cpu since version 5.3.0

loader: testing with cloud

Batch Transformer: error if shredding-complete message is not written

Log statements

Redshift Loader: migrate column if enum field's max length changes

Redshift Loader: Handle missing columns in CSV

← Metadata

Owner

Metadata

snowplow-rdb-loader snowplow-rdb-loader copied to clipboard

Metadata

← Metadata

Owner

Metadata

snowplow-rdb-loader
snowplow-rdb-loader copied to clipboard