2015-foia-hub icon indicating copy to clipboard operation
2015-foia-hub copied to clipboard

Rewrite web scrape of contacts in python -- output yaml

Open jackiekazil opened this issue 10 years ago • 6 comments

This ticket is for the rewrite of https://github.com/18F/foia/tree/master/contacts

In the process, please adjust the following ---

  • [x] Fix this bug which was manually updated in this commit -- https://github.com/18F/foia/commit/e918343e1192fec39c5e2f03d3ba791a65f06573#diff-d41d8cd98f00b204e9800998ecf8427e
  • [ ] Add the Chief FOIA Office data to the output. It is located under the Agency icon on the right. (I believe this is on the Agency, not on the office, but I am not certain.)
  • [x] This record gets cut off Farm Credit Administration description: "FCA´s mission is to ensure a dependable source of credit for agriculture and rural America. We do this in two ways:",
  • [ ] Put out put in this folder (https://github.com/18F/foia-core/tree/master/data), unless we want to make a data repo OR put all the data in main repo -- Open for discussion. I am more concerned with consistency.
  • [ ] Put script here: https://github.com/18F/foia-core/tree/master/foia_core/scripts -- This might move / if it is turned into a management command,... so this might be a temp holding place. This is to keep it with the other script and is open to discussion.
  • [x] Is this scrape error or data error? If data error, can you put in a fix for it output correctly in the yaml? https://github.com/18F/foia/commit/ee3fcdbcbdf0293e5b9be62e562836eda762b1e0
  • [x] This too: https://github.com/18F/foia/commit/a2f8f596e6a2a87faf0477de592ec0a381593914
  • [x] This too: https://github.com/18F/foia/commit/6b010bd9086e4ee1e88daf445219da380aba18f4

jackiekazil avatar Aug 11 '14 15:08 jackiekazil