data_tooling icon indicating copy to clipboard operation
data_tooling copied to clipboard

Create dataset wikihow_vietnamese_human_instructions

Open albertvillanova opened this issue 2 years ago • 2 comments

  • uid: wikihow_vietnamese_human_instructions
  • type: processed
  • description:
    • name: wikiHow Vietnamese Human Instructions
    • description: Step-by-step instructions in Vietnamese extracted from wikiHow and decomposed into a formal graph representation in RDF. For any queries and requests contact: Paolo Pareti To cite this dataset use: Paula Chocron, Paolo Pareti. Vocabulary Alignment for Collaborative Agents: a Study with Real-World Multilingual How-to Instructions. (PDF) (bibtex)
    • homepage: https://www.kaggle.com/paolop/human-instructions-vietnamese-wikihow
    • validated: True
  • languages:
    • language_names:
      • Vietnamese
    • language_comments:
    • language_locations:
      • Asia
      • Vietnam
    • validated: False
  • custodian:
    • name: Paolo Pareti
    • in_catalogue:
    • type: A university or research institution
    • location:
    • contact_name: Paolo Pareti
    • contact_email: [email protected]
    • contact_submitter: False
    • additional: https://w3id.org/people/paolo
    • validated: False
  • availability:
    • procurement:
      • for_download: Yes - it has a direct download link or links
      • download_url: https://www.kaggle.com/paolop/human-instructions-vietnamese-wikihow
      • download_email:
    • licensing:
      • has_licenses: Yes
      • license_text: CC BY-NC-SA 4.0
      • license_properties:
      • license_list:
        • cc-by-nc-4.0: Creative Commons Attribution Non Commercial 4.0 International
    • pii:
      • has_pii: Yes
      • generic_pii_likely: somewhat likely
      • generic_pii_list:
        • names
        • website account name or handle
        • URLs
      • numeric_pii_likely: somewhat likely
      • numeric_pii_list:
      • sensitive_pii_likely: somewhat likely
      • sensitive_pii_list:
      • no_pii_justification_class:
      • no_pii_justification_text:
    • validated: False
  • processed_from_primary:
    • from_primary: Taken from primary source
    • primary_availability: Yes - their documentation/homepage/description is available
    • primary_license: Yes - the dataset has the same license as the source material
    • primary_types:
      • web | wiki
    • validated: False
    • from_primary_entries:
  • media:
    • category:
      • text
    • text_format:
      • other
      • RDF
    • audiovisual_format:
    • image_format:
    • database_format:
      • .ZIP
    • text_is_transcribed: No
    • instance_type: Sentences / instructions
    • instance_count: 1K<n<10K
    • instance_size: 10<n<100
    • validated: False
  • fname: wikihow_vietnamese_human_instructions.json

albertvillanova avatar Jan 19 '22 08:01 albertvillanova