powerplantmatching
powerplantmatching copied to clipboard
Introduce/Include MaSTR data
Hello dear friends, I just finished a new data download of the MaStR database. The data quality is improving slowly. Now I am facing some problems in the wind dataset. There are around 10000 double entries I would like to find. Did someone already tried MaStR and PPM?
Originally posted by @Ludee in https://github.com/FRESNA/powerplantmatching/issues/11#issuecomment-566548943
New dataset available: https://zenodo.org/record/6807426#.Ywx33VpByV5
Latest version
https://zenodo.org/records/8225106
Citation
Eventually: https://joss.theoj.org/papers/dc0d33e7dc74f7233e15a7b6fe0c7a3e
Which CSV files?
- biomass
- combustion
- nuclear
- hydro
- (wind)
- (solar)
Has to be checked whether wind and solar dataset is too large to be handled efficiently.
Instructions by @FabianHofmann:
start with independent script: create a routine to retrieve them all (leaving out the renewables for the start) concat them potentially convert units rename and select columns to align with ppm format clean up everyting
modularization and ppm integration create a function mastr() which does all the above steps in one, add it to powerplantmatching/data.py, add a config entry tp package_data/config.py for making ppm aware of the new data input, use a high reliability score try out, add a test for pure retrieval
integrate into merging routine add mastr to matching_sources try out the matching with ppm.powerplants(update=True) run the https://github.com/PyPSA/powerplantmatching/blob/master/analysis/compare-with-entsoe-stats.py possibly adjust filter query and possibly add is to fully_included_sources in config.yaml