Greg Miller
Greg Miller
After running the EIA-923 allocation, we currently check whether the total plant-level generation and fuel matches the input data. However, we should also check for each fuel consumed at the...
Currently, all of our consumed emissions factor data is labeled as "carbon accounting data". While carbon accounting is one important use of this data, it has broader uses as well,...
When running the data pipeline for 2021, there are several warnings appearing for various data outputs regarding incomplete timeseries data: Data quality metric export: ``` 2023-06-09 20:20:56,968 [INFO] oge.output_data:158 Exporting...
When cleaning the EIA-930 data, the following warning is raised: ``` 2023-06-09 18:15:39,608 [INFO] oge.eia930:155 Running physics-based data cleaning 2023-06-09 18:15:40,865 [WARNING] load:203 Inconsistent columns: set(NG_cols) != set(ID_cols2) ``` After...
When running the 2021 data pipeline with our new data validation checks, the following warning is raised: ``` 2023-06-09 17:39:24,002 [WARNING] oge.validation:78 Allocated EIA-923 doesn't match input data for plants:...
We should implement some sort of outlier detection and screening for the hourly values reported in CEMS. This outlier detection could use a combination of statistical methods and physics-based methods...
Currently, many of our results files are denormalized and formatted in a way that assumes someone will be using these files programmatically and is able to easily merge the files...
Each plant can report data to EIA on three different frequencies: respondent_frequency | Description -- | -- A | The respondent only provides an annual total(s) for this record via...
Per the need identified in https://github.com/singularity-energy/open-grid-emissions/issues/167 and discussions at the OpEnMod Workshop about challenges with data availability in Canada, it may be worth adding emissions factor data from Statistics Canada...
We have heard user requests for total regional consumed emissions, which is a data point that we do not currently calculate. This calculation is fairly straightforward using the consumption based...