node-schema-org icon indicating copy to clipboard operation
node-schema-org copied to clipboard

FYI - schema.rdfs.org publishes schemas in JSON format

Open ghost opened this issue 12 years ago • 17 comments
trafficstars

http://schema.rdfs.org/

http://schema.rdfs.org/all.json

ghost avatar Aug 11 '13 14:08 ghost

The page is sadly not maintained anymore. If needed we scrape Schema.org every day and offer a download possibility at: http://schema.link.fish

http://schema.link.fish/downloads/all.json http://schema.link.fish/downloads/all.json.gz (same but gzip)

janober avatar Nov 19 '15 11:11 janober

@janober neat! Do you use this library to do the scraping?

indexzero avatar Nov 19 '15 12:11 indexzero

No have to confess I did use the one of http://schema.rdfs.org/: https://github.com/mhausenblas/schema-org-rdf/tree/master/scrapers

They were the first I found some time ago. This library I just found today.

Does this library still work or did it also break with the changes to the schema.org website?

janober avatar Nov 19 '15 13:11 janober

Might have broken, but I know that @rektide is currently using it.

indexzero avatar Nov 19 '15 13:11 indexzero

@janober cool to know about http://schema.link.fish/downloads/all.json !

Without any official json sources for schema.org do you intend to keep this one around as the best alternative?

0xgeert avatar Dec 11 '15 13:12 0xgeert

Yes that is the idea. I need the data and so would have to parse it regularly anyway so it is there to stay.

janober avatar Dec 11 '15 15:12 janober

@janober That's good to know thanks.

Any reason why datatypes isn't avail on the file?

0xgeert avatar Dec 12 '15 11:12 0xgeert

@gebrits Sorry there was something wrong in the parse script. Did fix it and now also the datatypes are back.

janober avatar Dec 14 '15 12:12 janober

cheers @janober :+1:

0xgeert avatar Dec 14 '15 23:12 0xgeert

hi @janober Sorry to bother but supertypes-property seems to be broken on a couple of instances.

More specifically any subtype of CreativeWork seems to have a supertypes = [] , while they should list supertypes = ["CreativeWork"] instead.

0xgeert avatar Dec 15 '15 16:12 0xgeert

Ah yes thanks, you are right. The new page-structure is really not great for extracting information. However I made some changes and all the ones I checked seem now to be fine.

Please tell me if you find any other issues.

janober avatar Dec 16 '15 08:12 janober

Are you using the released schema.org version (at http://schema.org) or some develop-branch?

I'm asking because the latest json misses a couple of properties which are avail on the released schema.org. This is not exhaustive (since I'm only checking for the ones we're using), but at least these properties are missing:

branchCode,containedInPlace,containsPlace,screenCount,iataCode,icaoCode,character,commentCount,hasPart,license,countryOfOrigin,composer,iswcCode,lyricist,recordedAs,isrcCode,recordingOf,catalogNumber,creditedTo,recordLabel,releaseOf,containsSeason,dissolutionDate,parentOrganization,sport,athlete,coach

0xgeert avatar Dec 16 '15 13:12 0xgeert

Ehm, all of them can be found under "properties" (at least the first few and the last one, the other ones I did not check because I also expect them to be there)

janober avatar Dec 16 '15 14:12 janober

Hmm sorry about that. Sublime apparently choked on the file. All good now

0xgeert avatar Dec 16 '15 14:12 0xgeert

No problem, great to hear ;-)

janober avatar Dec 16 '15 14:12 janober

@jaygray0919 Saw in the other thread that you were wondering how the file gets generated.

The original scrape-script can be found here: https://github.com/mhausenblas/schema-org-rdf sadly did it not work anymore because the website changed. So all I did is to fix it. The fixed version can be found in my fork here: https://github.com/janober/schema-org-rdf

That script got originally used by this website: http://schema.rdfs.org

However they stopped supporting it and for that reason is file they offer for download very old. So because we need the data for our site link.fish anyway I simply decided to take over for schema.rdfs.org and offer the scraped schema.org data for download.

Everything is also described on: http://schema.link.fish

janober avatar Dec 28 '15 17:12 janober

Apologies for being pedantic but, do either of the mentioned JSON extracts conform to JSON-Schema (http://json-schema.org)?

sixman9 avatar Jun 22 '16 02:06 sixman9