data.gov
data.gov copied to clipboard
O+M 2023-12-11
As part of day-to-day operation of Data.gov, there are many Operation and Maintenance (O&M) responsibilities. Instead of having the entire team watching notifications and risking some notifications slipping through the cracks, we have created an O&M Triage role. One person on the team is assigned the Triage role which rotates each sprint. This is not meant to be a 24/7 responsibility, only East Coast business hours. If you are unavailable, please note when you will be unavailable in Slack and ask for someone to take on the role for that time.
Check the O&M Rotation Schedule for future planning.
Acceptance criteria
You are responsible for all O&M responsibilities this week. We've highlighted a few so they're not forgotten. You can copy each checklist into your daily report.
Daily Checklist
Note: Catalog Auto Tasks You will need to update the chart values manually. Click the Action link in each issue and grab the values from
monitor task output
andcheck runtime
.
- [ ] DB-Solr Sync
- [ ] Check Harvesting Emails
- [ ] New Relic Alerts Triaged
- [ ] Triage DMARC Report from Google
Weekly Checklist
- [ ] Audit Log (more info on AU-3 and AU-6 Log auditing)
- [ ] Tracking Update
- NOTE: This job will consistently timeout, but it is processing results ((more details)[https://github.com/GSA/data.gov/issues/4345])
- [ ] Check Catalog Solr
- [ ] Catalog Dupe Check
Monthly Checklist
- [ ] Invicti Scan
ad-hoc checklist
- [ ] audit/review applications on cloud foundry and determine what can be stopped and/or deleted.
Reference
- Watch for user email requests
- Watch in #datagov-alerts and Vulnerable dependency notifications (daily email reports) for critical alerts.
- Monitor and improve Data.gov O&M Dashboard
- Update and revise Data.gov O&M Tasks
DOT Socrata Data.json returns 500 for recent job. this hasn't occurred before. will report back if issue resolves itself.
montgomery county md json returns 500 for recent job. doesn't appear to be a common thing.