croissant
croissant copied to clipboard
consider reusing CSVW and DQV
It's great that you reuse Schema.org. But please also consider reusing these:
- CSVW is for describing the semantics of CSV, or even exposing CSV tables as RDF. I see a big overlap, eg
cr:Field - DQV allows capturing objective and subjective quality observations about datasets.
- Perhaps it is applicable to the RAI (Responsible AI) part, where currently all props are simple text.
- See these papers: Introducing the Data Quality Vocabulary (DQV) A comprehensive quality model for Linked Data Automated approach for quality assessment of RDF resources
- See these usages ("implementations")
Hi Vladimir,
We considered CSVW, but it wasn't appropriate to describe the structure of data in Croissant, as it focuses on CSV tables. We needed a construct that could also describe unstructured data like text, images, etc., as well as nested data, like JSON, and allows joining data across these modalities.
Thanks for the pointers to DQV, definitely worth considering for RAI, as well as potential future extensions that are related to quality (e.g., in the health or geospatial domains).