data_tooling icon indicating copy to clipboard operation
data_tooling copied to clipboard

Tools for managing datasets for governance and training.

Results 100 data_tooling issues
Sort by recently updated
recently updated
newest added

- uid: 51_hani_stories_in_both_hani_and_english - type: primary - description: - name: 51 Hani Stories in both Hani and English - description: Parallel corpus. Published printed book collecting 51 folk tales, with...

data catalog
need custodian permission
need data sourcing feedback

- uid: vnexpress - type: primary - description: - name: VnExpress - description: VnExpress is a Vietnamese online newspaper, run by FPT Corporation. It is the first newspaper in Vietnam...

data catalog

- uid: radio_publique_africaine - type: primary - description: - name: Radio Publique Africaine - description: African Public Radio (Radio Publique Africaine or RPA) is a public radio station in Burundi....

data catalog
need custodian permission

- uid: verdad_abierta - type: primary - description: - name: Verdad Abierta - description: Medio periodistico independiente de noticias - homepage: https://verdadabierta.com/ - validated: True - languages: - language_names: -...

data catalog

- uid: oer_commons - type: primary - description: - name: OER Commons - description: OER Commons (OER for open educational resources) is a freely accessible online library that allows teachers...

data catalog

- uid: newspaper_in_basque - type: primary - description: - name: newspaper in Basque of a region - description: It is a Basque language magazine published by the Goiena Communication Group...

data catalog

- uid: global_voices_arabic - type: primary - description: - name: Global Voices Arabic - description: Global Voices pages in Arabic - homepage: https://ar.globalvoices.org/ - validated: True - languages: - language_names:...

data catalog

- uid: oarso_bidasoko_hitza - type: primary - description: - name: Oarso Bidasoko Hitza - description: Monolingual newspaper in Basque that compiles town level news of the area of Bidasoa (towns...

data catalog
need custodian permission

- uid: ekaia_ehuko_zientzia_eta_teknologia_aldizkaria - type: primary - description: - name: Ekaia EHUko Zientzia eta Teknologia aldizkaria - description: Ekaia is a scientific journal created in 1989. It publishes original works...

data catalog
need custodian permission

- uid: iarpa_babel_swahili_language_pack - type: processed - description: - name: IARPA Babel Swahili Language Pack - description: Swahili ASR Dataset. Official description says "IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d was...

data catalog
need custodian permission
need data sourcing feedback