masader
masader copied to clipboard
The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
There are many datasets that only provide partial data i.e either samples only or other forms like ids in twitter datasets.
Comments from @nizarhabash1 * The subset field is a bit confusing and comes too early * The paper information should be earlier
We want to create a page to highlight what were the main updates in this version.
Rather than having a simple text like this  Having a more unique styling of the landing page's header like this (not black and white)  On mobile (not black...
Add Paper Link
Add Paper Link
**Describe the dataset error** Hi, I was checking datasets on the great Masader site and found that two datasets are the exact duplicates, and unfortunately, the download link on the...