odis-arch icon indicating copy to clipboard operation
odis-arch copied to clipboard

connect UNEP (WESR) catalogue as ODIS node

Open jmckenna opened this issue 1 year ago • 6 comments

  • current status:
    • WESR team has enabled CKAN-DCAT extension, on dev server, to enable JSON-LD on each record page
      • production catalogue: https://wesr.unep.org/
    • WESR team to generate a sitemap.xml, to point to each catalogue page (that embeds JSON-LD)

cc @sobiero

jmckenna avatar Jun 13 '23 15:06 jmckenna

@jmckenna

During the call with WESR got the following URLs for us to inspect.

  • https://stg-data.unep.org/app/
  • https://stg-data.unep.org/dc/catalog.rdf
  • https://stg-data.unep.org/dc/catalog.xml
  • https://stg-data.unep.org/catalog.xml?page=1

Jeff, Simon Mogaka asked if you could send him the information on the JSON-LD/schema.org plugin for CKAN.

fils avatar Jul 25 '23 14:07 fils

@fils @jmckenna Please note the last link should be https://stg-data.unep.org/dc/catalog.xml?page=1 as the catalogue has been installed on the path /dc/

JSON-LD endpoint: https://stg-data.unep.org/dc/catalog.jsonld?page=1

And the documentation for the DCAT endpoint is available at: https://extensions.ckan.org/extension/dcat/#catalog-endpoint

sobiero avatar Jul 25 '23 15:07 sobiero

updates from last meeting:

  • catalogue is live: https://datacatalog.unep.org/app/
    • sample record: https://datacatalog.unep.org/app/dataset/unep-wcmc-rsrc-report-nature-risk-profile--a-methodology-for-profiling-nature-related-dependencies-a

Action items:

  1. WESR implements the CKAN-DCAT extension on their CKAN pages, so each page contains JSON-LD/schema.org content for ODIS to harvest
    • must set the default profile to be schemaorg in the extension
  2. WESR creates a sitemap.xml to point to the JSON-LD pages
    • for more info on the sitemap, see: https://developers.google.com/search/docs/crawling-indexing/sitemaps/build-sitemap
  3. WESR creates an entry in ODISCat for its catalogue
    • login with your OceanExpert ID and click Add
    • important fields are Startpoint URL for ODIS-Arch (this is your sitemap url) and Type of the ODIS-Arch URL (choose sitemap)
  4. ODIS creates an export for WESR to harvest, developed in its archinterfaces directory
  5. ODIS and WESR test exchanges to make sure they can cross-index each other's resources
  6. ODIS figures out how to extract ocean-relevant content from WESR

jmckenna avatar Nov 06 '23 18:11 jmckenna

Updates from meeting on 2023-11-24:

  • Herbert enabled entry of UNEP into the ODIS Catalogue using Ocean Expert ID and login details that we provided him. https://catalogue.odis.org/view/3288
  • WESR implemented the CKAN-DCAT extension on their CKAN pages, so that each page contains JSON-LD/schema.org content for ODIS to harvest.
  • WESR had also created a sitemap.xml to point to the JSON-LD pages https://datacatalog.unep.org/en/oceans_sitemap.xml
  • From our side (GEMS Ocean), we identified ocean-relevant datasets from WCMC, GPML, and SDG groups that the ODIS could consume. Please find link to the Ocean Relevant Data Sets

jmckenna avatar Jan 22 '24 16:01 jmckenna

from Hyrence email: Please find attached a document with some fields sourced from Ocean Info Hub for what we would like to pull. As per our previous discussion, the metadata we would like to obtain for the topics through your JSON API include; Title, description, source url, last-updated, keywords/category/variables attached, spatial, provider/organisation, provider email, data type e.g. html, doc, excel, etc,. More specifics could be discussed in the next meeting.

Data Sets R.xlsx

jmckenna avatar Apr 04 '24 13:04 jmckenna

@fils @jmckenna

The next steps:

  • We should test the WESR harvest through the sitemap they've provided
  • We need to set up some queries that WESR can modify, to extract JSON-LD records with the keywords (including those in DefinedTerm stanzas) and variableMeasured values specified in the XLSX file here

pbuttigieg avatar Apr 04 '24 13:04 pbuttigieg

14 Nov 2024 Meeting with Joanna, Ken, Herbert, Jeff, Lucy

Lucy-Scott avatar Nov 14 '24 13:11 Lucy-Scott

Tasks/notes from the 14 Nov meeting:

  • :white_check_mark: Jeff to switch the ODISCat entry (for the "UNEP Data Catalogue") to Joana's OceanExpert account
  • Herbert: are switching from CKAN to the DataHub Project

jmckenna avatar Dec 03 '24 14:12 jmckenna

The minimum metadata fields we would like to obtain for the keywords (with pagination/limit of datasets per request and total no. of datasets found) through your JSON endpoint include; title, description, source url, last-updated, keywords/category, spatial/location, provider/organisation, provider email, data-type e.g. html, doc, excel, etc.

  Key Words
1 Coral reefs
2 Coastal processes
3 Coral bleaching
4 Community Monitoring
5 Oceans
6 Marine
7 Global Warming
8 Sea Grass
9 Mangoves
10 Data
11 Observation
12 Biodiversity
13 Coastal fisheries
14 Climate change effects
15 Marine Inverterbrates
16 Marine Vertebrates
17 Corals
18 Land Use
19 Coastal Zone Planning
20 Land Use Planning
21 National Planning
22 Policy Planning
23 Blue Carbon
24 Coastal protection
25 Recreation
26 Rocky Shores
27 Occurrence
28 Coral Cover
29 Turtles
30 Ecology
31 Environmental Management
32 Habitats and Biotopes
33 Geo Scientific Information
34 Marine Habitats
35 Marine Data
36 Sea Bed
37 Dredging
38 Turbidity
39 Industrial Discharges
40 Earth Science
41 Species Distribution
42 Marine Environmental Baselines
43 Biota
44 Ocean acidification
45 Marine National Facility
46 Marine Survey
47 Ocean temperatures
48 Gloal Positioning System
49 Marine Spatial Plan
50 Area Management
51 Regional Seas
52 Fishery Law and legislation
53 Benthic Habitat
54 Ocean Winds
55 Intertidal Zones
56 Endagered Species

kimanih avatar Jan 16 '25 13:01 kimanih