gridemissions icon indicating copy to clipboard operation
gridemissions copied to clipboard

Bulk Processed Files - dealing with duplicate periods across years

Open klo9klo9kloi opened this issue 10 months ago • 2 comments

When looking at e.g. EIA930_2019_Jan_Jun_co2.csv, the period starts a few hours after 2019-01-01 00:00:00 rather than at the mark, which I'm guessing has something to do with UTC shifting.

Keeping that example file, if I then look at EIA930_2018_Jul_Dec_co2.csv, the period also overflows into year 2019 for a few hours, such that if I concatenate these two files then there are some duplicate periods.

If I am aggregating emissions by year, what is the proper way to deal with these duplicate period rows? Aggregate? Take the one from the latest year?

klo9klo9kloi avatar Apr 25 '24 19:04 klo9klo9kloi