lam icon indicating copy to clipboard operation
lam copied to clipboard

Add dataset: WWI_documents_dataset

Open Skorkmaz88 opened this issue 2 years ago • 2 comments

A URL for this dataset

https://rdf.muninn-project.org/

Dataset description

This dataset is actually about WWI archives, specifically documents subcategory from the store above, the data is in linked format. Currently I am prototyping a converter for tabular format by consuming sparql endpoint of the archieve. As result final output will name, label, primary topic of document, scanned images of first_page, and last_page (I am planning to omit any other pages if they are available), access rights for each entry, origin country.

The dataset may be used for old document, WWI document classification such as looking at doc and classifying it as an attestation paper from Canadian origin.

Dataset modality

Mixed

Dataset licence

Other license

Other licence

Each item will have license associated

How can you access this data

Via an open API

Confirm the dataset has an open licence

  • [X] To the best of my knowledge, this dataset is accessible via an open licence

Contact details for data custodian

No response

Skorkmaz88 avatar Jul 13 '22 18:07 Skorkmaz88

This sounds great, thanks for suggesting it! If you also want to work on adding this feel free to use the #self-assign command to assign yourself to work on this.

davanstrien avatar Jul 13 '22 18:07 davanstrien

#self-assign

Skorkmaz88 avatar Jul 13 '22 18:07 Skorkmaz88