pramen issues

A Hive table fails to load if a custom schema is used.

## Describe the bug Originally, this happened when decimal correction is used with Hive, and there are columns having decimal(38,18) types. Pramen tries to 'correct' the schema by applying a...

yruslan

bug

Pramen-Scala

DS

Improve unit test coverage of ResultSetToRowIterator.scala

## Feature Improve unit test overage of ResultSetToRowIterator.scala.

yruslan

enhancement

Pramen-Scala

Refactoring

Add support for incremental ingestion

## Background Currently, incremental updates are made by overwriting the latest info date partitions multiple times a day. This can be inefficient, especially for big tables with many events. If...

yruslan

enhancement

Pramen-Scala

DE

Create a CDC transformer

## Background The CDC transformer can take: - a table ingested as an initial snapshot, and then changes only - the primary key - the pre-combine key (timestamp) and transform...

yruslan

enhancement

Pramen-Scala

DE

Add support for S3 versions cleanup via a special REST API call

## Background This is a requirement for Enceladus and Spark versions that do not support committers without copying of data. ## Feature Add support for S3 versions cleanup via a...

yruslan

enhancement

Pramen-Scala

Priority

DE

Add an ability to format info date and insert date expression into JDBC SQL queries

## Background Users want to customize SQL queries with information date based dates as part of an expression, including table names. For example: ``` SELECT * FROM my_table_202402 ``` where...

yruslan

enhancement

Pramen-Scala

DS

Add a mode for initial ingestion of event data

## Background When an event table is ingested initially, the history can be quite long. Executing each event date one by one can have a big overhead and execute many...

yruslan

enhancement

Pramen-Scala

DE

Use more effective record count when JDBC source is used with an SQL query

## Background This idea is reported by @filiphornak Currently the record count is calculated this way if SQL expression (rather than table name) is used as an input to the...

yruslan

enhancement

Calculate and log stability metric for each operation

## Background Stability metric is computed based on the number of input and output dependencies: ![Screenshot 2024-01-11 at 10 32 42](https://github.com/AbsaOSS/pramen/assets/4082463/5fcb719b-fb60-4d73-b0ce-73382f86d6db) - `I` - number of input dependencies - the...

yruslan

enhancement

Pramen-Scala

Add 'pramen-test' artifact that contains fixtures for testing Pramen components

## Background Currently, fixtures used for testing Pramen sources, sinks, and transformations are in the test code only, and not published as part of an artifact. It would be helpful...

yruslan

enhancement

Pramen-Scala

pramen
pramen copied to clipboard

Metadata

A Hive table fails to load if a custom schema is used.

Improve unit test coverage of ResultSetToRowIterator.scala

Add support for incremental ingestion

Create a CDC transformer

Add support for S3 versions cleanup via a special REST API call

Add an ability to format info date and insert date expression into JDBC SQL queries

Add a mode for initial ingestion of event data

Use more effective record count when JDBC source is used with an SQL query

Calculate and log stability metric for each operation

Add 'pramen-test' artifact that contains fixtures for testing Pramen components

← Metadata

Owner

Metadata

pramen pramen copied to clipboard

Metadata

← Metadata

Owner

Metadata

pramen
pramen copied to clipboard