finops-toolkit icon indicating copy to clipboard operation
finops-toolkit copied to clipboard

[Hubs] Don't ingest empty files

Open flanakin opened this issue 5 months ago â€ĸ 3 comments

🐛 Problem

The ingestion_ExecuteETL pipeline triggers ingestion_ETL_dataExplorer when there are no rows.

đŸ‘Ŗ Repro steps

  1. Export data for a period which has no data (e.g., transactions on the first of the month)
  2. The ingestion_ExecuteETL pipeline will trigger the ingestion_ETL_dataExplorer pipeline for a single file with no rows, which will fail to ingest because there are no rows

🤔 Expected

Nothing should be ingested when there are no rows. There should be no errors.

â„šī¸ Additional context

Update the ingestion_ExecuteETL pipeline to check the export manifest.json file for the "dataRowCount" integer JSON property. If the value is 0, do not try to ingest data. If we fail to read the value, continue to ingest the data assuming there is at least one row.

đŸ™‹â€â™€ī¸ Ask for the community

We could use your help:

  1. Please vote this issue up (👍) to prioritize it.
  2. Leave comments to help us solidify the vision.

flanakin avatar Jul 01 '25 21:07 flanakin

This has also totally confused me the last days and I can confirm this issue and would vote for it to be fixed.

bmargula avatar Sep 12 '25 11:09 bmargula

This issue specifies the solution design for the bug reported in #1535.

MSBrett avatar Oct 11 '25 18:10 MSBrett

Just did a multi tenant installation (8 tenants). most tenants are new, so no reservations or recommendation data. The daily ingestion process takes more than 10 hours due to the export files with 0 lines being processed. Definitely would vote for adding the check for "dataRowCount"=0.

japyjaap avatar Nov 11 '25 09:11 japyjaap