data_kitten
data_kitten copied to clipboard
Get dataset metadata in a consistent format - no matter what you throw at it
Bumps [rspec](https://github.com/rspec/rspec-metagem) from 3.11.0 to 3.12.0. Commits 58d7eac v3.12.0 f4393df Version 3.11.0 See full diff in compare view [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=rspec&package-manager=bundler&previous-version=3.11.0&new-version=3.12.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as...
A web page might have a ``link`` to an alternate machine-readable view of the page. Or the HTTP response might have a ``Link`` header that does the same. Would be...
JSON-LD metadata can be embedded in web pages: https://developers.google.com/schemas/formats/json-ld E.g. to describe datasets for indexing by Google: https://developers.google.com/search/docs/data-types/datasets Or scientific datasets more generally: http://dx.doi.org/10.1101/097196 Would be useful if data kitten...
When pulling from CKAN instances, the description is always treated as plain text, when it is actually markdown - https://github.com/theodi/data_kitten/blob/master/lib/data_kitten/publishing_formats/ckan.rb#L53 This shows up as an issue when generating certificates using...
Assumes CKAN site is [located at root `/`](https://github.com/theodi/data_kitten/blob/62568a05c1/lib%2Fdata_kitten%2Fpublishing_formats%2Fckan.rb#L257), which it sometimes isn't.
If a CKAN API version 3 URL is passed to DataKitten::Dataset, it makes the request, extracts the `id` and makes [a second request](https://github.com/theodi/data_kitten/blob/8e67213d98/lib%2Fdata_kitten%2Fpublishing_formats%2Fckan.rb#L30) to the version 2 endpoint, which it...
Using `accrualPeriodicity` instead of `update_frequency` as a method name in Dataset seems like it would be more consistent with [DCAT](http://www.w3.org/TR/vocab-dcat/#Property:dataset_frequency).
A datapackage resource might have a "path", or a "url" or both. Sometimes we're using on and sometimes the other, we ought to normalise to a URL, taking into account...
This stores a media type, so related to format and extension stuff we have already.