croissant icon indicating copy to clipboard operation
croissant copied to clipboard

Automatically publish HTML pages with embedded Croissant mark-up from Github datasets

Open benjelloun opened this issue 2 years ago • 0 comments

Croissant datasets in our Github repository are represented as standalone JSON-LD files. In order to make them crawlable, they should be made accessible as web pages with the JSON-LD embedded inside a script tag.

Instead of creating those pages manually, we should automate the process by:

  1. Creating a Python script that converts the JSON-LD standalone file into the desired HTML + JS representation
  2. Creating a Github action that calls the script and commits a new or updated HTML file whenever a dataset description is changed in the repository.

benjelloun avatar Sep 26 '23 13:09 benjelloun