police-data-trust icon indicating copy to clipboard operation
police-data-trust copied to clipboard

[FEATURE] Data Insert Logic - Officer

Open DMalone87 opened this issue 1 year ago • 1 comments

Extend the Flask API to read from a .jsonl file of scraped Officers data and insert it into the Postgres DB.

Is your feature request related to a problem? Please describe. As part of our data ETL pipeline, we need to transform data that has been scraped and loaded into an S3 bucket and load it into our database. This feature focuses on loading officer data.

Describe the solution you'd like

  • Read data from the attached officers.jsonl file
  • Transform the data so that it conforms to our officer data schema
  • Verify that the officer data is not a duplicate of some data that was previously loaded (and update if changed)
  • Load the officer data into the database

Additional context Data from a scraper is a requirement for this task. You can download that fixture here.

DMalone87 avatar May 15 '24 00:05 DMalone87

50a.jsonl.zip

mikeyavorsky avatar May 22 '24 00:05 mikeyavorsky