exdata icon indicating copy to clipboard operation
exdata copied to clipboard

Case Study: 2012 Dataset is incomplete (ends on Oct 18, 2012)

Open Pinatata opened this issue 3 years ago • 0 comments

After further exploring the provided data sets for the final case study, it seems that the provided 2012 data set is incomplete for the year and ends on Oct 18, 2012. This has no effect on the code, but the case study text fails to explore or acknowledge the incompleteness of the 2012 data set. This is a glaring omission given the quoted hypothesis:

Our overall hypothesis is that outdoor PM2.5 has decreased on average across the U.S. due to nationwide regulatory requirements arising from the Clean Air Act.

The case study explores changes in PM2.5 levels YoY for each state, but it should be acknowledged that the lack of 2012Q4 data makes for incomplete comparisons between 2012 and 1999.

I've tried to find an equivalent & up-to-date 2012 dataset to make a more complete comparison but the EPA's daily_88101_2012 dataset has many inconsistencies compared to your provided 2012 dataset, namely in the way 'Site Codes' are handled. If there's a different dataset I should be using I'd appreciate being pointed in the right direction.

Otherwise, I think it would be fair to at least acknowledge in the case study text the incompleteness of the 2012 data.

image

image

Pinatata avatar Feb 12 '22 17:02 Pinatata