gridstatus
gridstatus copied to clipboard
Clean Up Interconnection Queue Data
Each of the iso's now have a get_interconection_queue method that returns their respective queues. Some work was done to standardize columns names.
Each queue is useful on it's own, but combined there are a still small data difference
Status Column
One first area to look at is the Status field where there are many different but similar values
all_queues = gridstatus.get_interconnection_queues()
all_queues["Status"].value_counts()
Withdrawn 7405
Active 6431
WITHDRAWN 1481
In Service 1028
Completed 751
Done 746
ACTIVE 594
Engineering and Procurement 301
Confirmed 223
COMPLETED 199
Retracted 131
Suspended 74
Under Construction 58
Deactivated 54
Partially in Service - Under Construction 46
Annulled 24
Withdrawn - Pending Cure 2
1
Name: Status, dtype: int64
We should standardize these values. There can be an extra column preserved where the original status remains.
There is a InterconnectionQueueStatus enum in base.py that should contain all possible values
Interconnection Agreement Completed Date vs Project Completed Date
Should confirm that the completion dates are all accurately and consistently mapped