odis-arch
odis-arch copied to clipboard
connect UNEP (WESR) catalogue as ODIS node
- current status:
- WESR team has enabled CKAN-DCAT extension, on dev server, to enable JSON-LD on each record page
- production catalogue: https://wesr.unep.org/
- WESR team to generate a
sitemap.xml
, to point to each catalogue page (that embeds JSON-LD)
- WESR team has enabled CKAN-DCAT extension, on dev server, to enable JSON-LD on each record page
cc @sobiero
@jmckenna
During the call with WESR got the following URLs for us to inspect.
- https://stg-data.unep.org/app/
- https://stg-data.unep.org/dc/catalog.rdf
- https://stg-data.unep.org/dc/catalog.xml
- https://stg-data.unep.org/catalog.xml?page=1
Jeff, Simon Mogaka asked if you could send him the information on the JSON-LD/schema.org plugin for CKAN.
@fils @jmckenna Please note the last link should be https://stg-data.unep.org/dc/catalog.xml?page=1 as the catalogue has been installed on the path /dc/
JSON-LD endpoint: https://stg-data.unep.org/dc/catalog.jsonld?page=1
And the documentation for the DCAT endpoint is available at: https://extensions.ckan.org/extension/dcat/#catalog-endpoint
updates from last meeting:
- catalogue is live: https://datacatalog.unep.org/app/
- sample record: https://datacatalog.unep.org/app/dataset/unep-wcmc-rsrc-report-nature-risk-profile--a-methodology-for-profiling-nature-related-dependencies-a
Action items:
- WESR implements the CKAN-DCAT extension on their CKAN pages, so each page contains JSON-LD/schema.org content for ODIS to harvest
- must set the default profile to be
schemaorg
in the extension
- must set the default profile to be
- WESR creates a
sitemap.xml
to point to the JSON-LD pages- for more info on the sitemap, see: https://developers.google.com/search/docs/crawling-indexing/sitemaps/build-sitemap
- WESR creates an entry in ODISCat for its catalogue
- login with your OceanExpert ID and click
Add
- important fields are
Startpoint URL for ODIS-Arch
(this is your sitemap url) andType of the ODIS-Arch URL
(choosesitemap
)
- login with your OceanExpert ID and click
- ODIS creates an export for WESR to harvest, developed in its
archinterfaces
directory- UPDATE: directory shell is created at archinterfaces/ODIS-WESR, where steps will be shared
- ODIS and WESR test exchanges to make sure they can cross-index each other's resources
- ODIS figures out how to extract ocean-relevant content from WESR
Updates from meeting on 2023-11-24:
- Herbert enabled entry of UNEP into the ODIS Catalogue using Ocean Expert ID and login details that we provided him. https://catalogue.odis.org/view/3288
- WESR implemented the CKAN-DCAT extension on their CKAN pages, so that each page contains JSON-LD/schema.org content for ODIS to harvest.
- WESR had also created a sitemap.xml to point to the JSON-LD pages https://datacatalog.unep.org/en/oceans_sitemap.xml
- From our side (GEMS Ocean), we identified ocean-relevant datasets from WCMC, GPML, and SDG groups that the ODIS could consume. Please find link to the Ocean Relevant Data Sets
from Hyrence email:
Please find attached a document with some fields sourced from Ocean Info Hub for what we would like to pull. As per our previous discussion, the metadata we would like to obtain for the topics through your JSON API include; Title, description, source url, last-updated, keywords/category/variables attached, spatial, provider/organisation, provider email, data type e.g. html, doc, excel, etc,. More specifics could be discussed in the next meeting.
@fils @jmckenna
The next steps:
- We should test the WESR harvest through the sitemap they've provided
- We need to set up some queries that WESR can modify, to extract JSON-LD records with the keywords (including those in DefinedTerm stanzas) and variableMeasured values specified in the XLSX file here
14 Nov 2024 Meeting with Joanna, Ken, Herbert, Jeff, Lucy
Tasks/notes from the 14 Nov meeting:
- :white_check_mark: Jeff to switch the ODISCat entry (for the "UNEP Data Catalogue") to Joana's OceanExpert account
- Herbert: are switching from CKAN to the DataHub Project
The minimum metadata fields we would like to obtain for the keywords (with pagination/limit of datasets per request and total no. of datasets found) through your JSON endpoint include; title, description, source url, last-updated, keywords/category, spatial/location, provider/organisation, provider email, data-type e.g. html, doc, excel, etc.
Key Words | |
---|---|
1 | Coral reefs |
2 | Coastal processes |
3 | Coral bleaching |
4 | Community Monitoring |
5 | Oceans |
6 | Marine |
7 | Global Warming |
8 | Sea Grass |
9 | Mangoves |
10 | Data |
11 | Observation |
12 | Biodiversity |
13 | Coastal fisheries |
14 | Climate change effects |
15 | Marine Inverterbrates |
16 | Marine Vertebrates |
17 | Corals |
18 | Land Use |
19 | Coastal Zone Planning |
20 | Land Use Planning |
21 | National Planning |
22 | Policy Planning |
23 | Blue Carbon |
24 | Coastal protection |
25 | Recreation |
26 | Rocky Shores |
27 | Occurrence |
28 | Coral Cover |
29 | Turtles |
30 | Ecology |
31 | Environmental Management |
32 | Habitats and Biotopes |
33 | Geo Scientific Information |
34 | Marine Habitats |
35 | Marine Data |
36 | Sea Bed |
37 | Dredging |
38 | Turbidity |
39 | Industrial Discharges |
40 | Earth Science |
41 | Species Distribution |
42 | Marine Environmental Baselines |
43 | Biota |
44 | Ocean acidification |
45 | Marine National Facility |
46 | Marine Survey |
47 | Ocean temperatures |
48 | Gloal Positioning System |
49 | Marine Spatial Plan |
50 | Area Management |
51 | Regional Seas |
52 | Fishery Law and legislation |
53 | Benthic Habitat |
54 | Ocean Winds |
55 | Intertidal Zones |
56 | Endagered Species |