data_tooling icon indicating copy to clipboard operation
data_tooling copied to clipboard

Tools for managing datasets for governance and training.

Results 100 data_tooling issues
Sort by recently updated
recently updated
newest added

- uid: uit_viquad - type: processed - description: - name: UIT-ViQuAD – A Vietnamese Dataset for Evaluating Machine Reading Comprehension. - description: Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset...

data catalog
need custodian permission

- uid: titml_idn_speech_corpus - type: processed - description: - name: TITML-IDN speech corpus - description: TITML-IDN contains Bahasa Indonesia speech data from 20 Indonesian speakers, 9. Each speaker was asked...

data catalog
need custodian permission

- uid: tirto_id - type: primary - description: - name: tirto.id - description: Tirto.id is a news, article, opinion, and infographic website in Indonesia. First broadcast in February 2016 and...

data catalog
need custodian permission

- uid: (UIT-ViOCD) - type: processed - description: - name: Vietnamese Complaint Detection on E-Commerce Websites - description: Customer product reviews play a role in improving the quality of products...

data catalog
need custodian permission

- uid: aaj_tak - type: primary - description: - name: Aaj Tak - description: Hindi news production - homepage: https://www.aajtak.in/ - validated: True - languages: - language_names: - Indic -...

data catalog
need custodian permission

- uid: editora_trinta_nove_mozambique - type: primary - description: - name: Editora Trinta Nove Mozambique - description: - homepage: https://www.africanbookscollective.com/publishers/editora-trinta-zero-nove - validated: True - languages: - language_names: - Niger-Congo - language_comments:...

data catalog
need custodian permission

- uid: languages_of_mozambique_lidemo - type: primary - description: - name: languages of mozambique lidemo - description: LIDEMO.NET holds an electronic library of publications in Mozambican languages. - homepage: https://lidemo.net/ -...

invalid
data catalog
need data sourcing feedback

- uid: australian_twittersphere - type: processed - description: - name: Australian Twittersphere - description: The Australian Twittersphere is a longitudinal, curated collection of tweets from approximately 838,000 Twitter accounts identified...

data catalog
need custodian permission

- uid: UIT-ViHSD - type: processed - description: - name: Vietnamese Hate Speech Detection Dataset - description: In recent years, Vietnam witnesses the mass development of social network users on...

data catalog
need custodian permission

- uid: neliti - type: primary - description: - name: Neliti - Indonesian research repository - description: Neliti is a research repository of Indonesian journal articles, books, research reports, policy...

data catalog
need custodian permission