data_tooling
data_tooling copied to clipboard
Create dataset wikihow_vietnamese_human_instructions
- uid: wikihow_vietnamese_human_instructions
- type: processed
- description:
- name: wikiHow Vietnamese Human Instructions
- description: Step-by-step instructions in Vietnamese extracted from wikiHow and decomposed into a formal graph representation in RDF. For any queries and requests contact: Paolo Pareti To cite this dataset use: Paula Chocron, Paolo Pareti. Vocabulary Alignment for Collaborative Agents: a Study with Real-World Multilingual How-to Instructions. (PDF) (bibtex)
- homepage: https://www.kaggle.com/paolop/human-instructions-vietnamese-wikihow
- validated: True
- languages:
- language_names:
- Vietnamese
- language_comments:
- language_locations:
- Asia
- Vietnam
- validated: False
- language_names:
- custodian:
- name: Paolo Pareti
- in_catalogue:
- type: A university or research institution
- location:
- contact_name: Paolo Pareti
- contact_email: [email protected]
- contact_submitter: False
- additional: https://w3id.org/people/paolo
- validated: False
- availability:
- procurement:
- for_download: Yes - it has a direct download link or links
- download_url: https://www.kaggle.com/paolop/human-instructions-vietnamese-wikihow
- download_email:
- licensing:
- has_licenses: Yes
- license_text: CC BY-NC-SA 4.0
- license_properties:
- license_list:
- cc-by-nc-4.0: Creative Commons Attribution Non Commercial 4.0 International
- pii:
- has_pii: Yes
- generic_pii_likely: somewhat likely
- generic_pii_list:
- names
- website account name or handle
- URLs
- numeric_pii_likely: somewhat likely
- numeric_pii_list:
- sensitive_pii_likely: somewhat likely
- sensitive_pii_list:
- no_pii_justification_class:
- no_pii_justification_text:
- validated: False
- procurement:
- processed_from_primary:
- from_primary: Taken from primary source
- primary_availability: Yes - their documentation/homepage/description is available
- primary_license: Yes - the dataset has the same license as the source material
- primary_types:
- web | wiki
- validated: False
- from_primary_entries:
- media:
- category:
- text
- text_format:
- other
- RDF
- audiovisual_format:
- image_format:
- database_format:
- .ZIP
- text_is_transcribed: No
- instance_type: Sentences / instructions
- instance_count: 1K<n<10K
- instance_size: 10<n<100
- validated: False
- category:
- fname: wikihow_vietnamese_human_instructions.json