data.gov icon indicating copy to clipboard operation
data.gov copied to clipboard

O+M 2024-04-22

Open Jin-Sun-tts opened this issue 10 months ago • 4 comments

As part of day-to-day operation of Data.gov, there are many Operation and Maintenance (O&M) responsibilities. Instead of having the entire team watching notifications and risking some notifications slipping through the cracks, we have created an O&M Triage role. One person on the team is assigned the Triage role which rotates each sprint. This is not meant to be a 24/7 responsibility, only East Coast business hours. If you are unavailable, please note when you will be unavailable in Slack and ask for someone to take on the role for that time.

Check the O&M Rotation Schedule for future planning.

Acceptance criteria

You are responsible for all O&M responsibilities this week. We've highlighted a few so they're not forgotten. You can copy each checklist into your daily report.

Daily Checklist

Note: Catalog Auto Tasks You will need to update the chart values manually. Click the Action link in each issue and grab the values from monitor task output and check runtime.

Weekly Checklist

Monthly Checklist

ad-hoc checklist

  • [ ] audit/review applications on cloud foundry and determine what can be stopped and/or deleted.

Reference

Jin-Sun-tts avatar Apr 22 '24 16:04 Jin-Sun-tts

DOE's /harvest/arm-data-json are down for two days

Screenshot 2024-04-22 at 12 59 20 PM

Jin-Sun-tts avatar Apr 22 '24 17:04 Jin-Sun-tts

Tuesday 04/23

https://github.com/GSA/data.gov/

Image

rerun the failed commit job, it succeeded.

Check Catalog Auto Tasks

Check Harvesting Emails

Update: Could not access the DOE site for this harvest source : https://www.archive.arm.gov/data/, but the job finished without error today. Looks like their server goes up and down.

  • [x] Catalog:

  • [x] DB-Solr Sync:
    0 packages need to be removed from Solr 0 packages need to be updated/added to Solr 429 packages without harvest_object need to be mannually deleted Finished 520s

Checked catalog, inventory production, works fine.

Also checked Solr leader and followers, all work as normal.

Jin-Sun-tts avatar Apr 23 '24 14:04 Jin-Sun-tts

Tuesday 04/23

https://github.com/GSA/data.gov/

Screenshot 2024-04-25 at 9 45 55 AM

Check Catalog Auto Tasks

Check Harvesting Emails

Harvest Source: NASA Data.json
Organization: nasa-gov
Created: 2024-04-24 16:52:45.861624
Finished: 2024-04-24 16:54:02.582310
    - Error loading json content: not enough values to unpack (expected 2, got 0).
    - ProxyError getting json source: HTTPSConnectionPool(host='[data.nasa.gov](http://data.nasa.gov/)', port=443): Max retries exceeded with url: /data.json (Caused by ProxyError('Unable to connect to proxy', RemoteDisconnected('Remote end closed connection without response'))).

The error occurred on April 23, 2024, but it was success on April 22, 2024. This issue has occurred intermittently in the past, indicating possible issue in resource availability

Harvest Source: DOI EDI
Organization: doi-gov
Created: 2024-04-24 16:52:45.119525
Finished: 2024-04-24 16:53:46.331441
    - Error loading json content: not enough values to unpack (expected 2, got 0).
    - HTTPError getting json source: 504 Server Error: Gateway Time-out for url: https://datainventory.doi.gov/data.json.
   

The job run on April 24, 2024, at 5:23 PM was successful. This connection error had not occurred previously.

  • [x] Catalog:

  • [x] DB-Solr Sync:
    0 packages need to be removed from Solr 0 packages need to be updated/added to Solr 429 packages without harvest_object need to be mannually deleted Finished 555s

Checked catalog, inventory production, works fine.

Also checked Solr leader and followers, all work as normal.

Jin-Sun-tts avatar Apr 25 '24 13:04 Jin-Sun-tts

As one user pointed out, harvester /harvest/energy-json appears to be using a wrong URL. The current https://www.energy.gov/sites/default/files/2023-04/pdl040323.json is frozen at 2023-04 time frame. A dynamic URL https://www.energy.gov/data.json seems to be the correct URL, it redirects to the current year current month. @hkdctol

FuhuXia avatar Apr 26 '24 14:04 FuhuXia