hyperdrive icon indicating copy to clipboard operation
hyperdrive copied to clipboard

DeduplicateKafkaSinkTransformer should work with fields from payload

Open kevinwallimann opened this issue 3 years ago • 0 comments

When changing https://github.com/AbsaOSS/hyperdrive/blob/develop/driver/src/test/scala/za/co/absa/hyperdrive/driver/drivers/KafkaToKafkaDeduplicationAfterRetryDockerTest.scala#L73-L74

to

      "transformer.[kafka.deduplicator].source.id.columns" -> "value.record_id",
      "transformer.[kafka.deduplicator].destination.id.columns" -> "value.record_id"

the test fails with the following exception:

org.apache.spark.SparkException: Malformed records are detected in record parsing.
Caused by: org.apache.avro.AvroTypeException: Found null, expecting string

Even if this behavior is documented, it may never have been working.

kevinwallimann avatar Sep 27 '21 14:09 kevinwallimann