dagger issues

Context: To leave out the complexities of handling multiple sinks. We would like to integrate Dagger & [Firehose](https://github.com/odpf/firehose) in a way where all the sink logic is abstracted out in...

gauravsinghania

enhancement

current_iteration

feat: Parquet DataSource should provide ability to read multiple GCS buckets for creating multiple streams

1

As part of this issue, want to add support for handling multiple streams for Parquet Data Source. That is, users should be able to specify multiple GCS URLs. Dagger should...

Meghajit

feat: Analyse different strategies and add validation for missing fields in parquet data compared to protobuf schema

1

Extra fields present in the parquet data not present in the protobuf schema will be ignored. However, it might be possible that: - there are some fields in protobuf schema...

Meghajit

feat: implement IndexOrderedSplitAssigner

1

As part of this issue, we want to add support for configuring a split assigner which can assign splits to source readers in an `almost-deterministic` order as decided by the...

Meghajit

Implement BQ sink in Dagger using ODPF sink

**Acceptance Criteria** - Add Basic BigQuery Sink using the sink-connector APIs. - Configure Proto/JSON Serialiser to convert Row into OdpfMessage - Call sink APIs to push messages to BQ. **Out...

gauravsinghania

current_iteration

feat: Create sink-connectors repo and add atleast one sink with one abstraction

**Context:** As a first task, we want to create a repo which will hold the sink-connector library. This library will be used by both firehose and dagger. it might be...

gauravsinghania

current_iteration

Handle null fields in api payload

1

Currently, there are no null checks for required params coming from the run job API payload. Add support to check and handle them with proper error messages.

prakharmathur82

enhancement

good first issue

Make HashTransformer use common SerDe

Currently, the HashTransformer uses its own implementation for protobuf SerDe. * Move SerDe logic from dagger-core to dagger-common. * Use common SerDe for both core and HashTransformer.

prakharmathur82

enhancement

dagger
dagger copied to clipboard

Metadata

feat: Bigquery sink using depot (#154)

test: Add integration test for Parquet Data Source

analyze: support for multiple sinks in dagger

feat: Parquet DataSource should provide ability to read multiple GCS buckets for creating multiple streams

feat: Analyse different strategies and add validation for missing fields in parquet data compared to protobuf schema

feat: implement IndexOrderedSplitAssigner

Implement BQ sink in Dagger using ODPF sink

feat: Create sink-connectors repo and add atleast one sink with one abstraction

Handle null fields in api payload

Make HashTransformer use common SerDe

← Metadata

Owner

Metadata

dagger dagger copied to clipboard

Metadata

← Metadata

Owner

Metadata

dagger
dagger copied to clipboard