Farsi-datasets icon indicating copy to clipboard operation
Farsi-datasets copied to clipboard

A collection of Farsi (Persian) datasets

This repository contains Farsi (Persian) datasets for Machine Learning tasks, particularly NLP.

The purpose of this repository is to share the Farsi datasets for whoever is interested in doing Farsi NLP.

  1. Farsi Wiki Dataset
  2. Farsi News datasets

Contributions

If you are interested in contributing, and have Farsi datasets that would like to share with the Farsi NLP community:

  • You can upload them directly and submit a pull request (PR).
  • You can share your GitHub repository and send me the link of your repo.
  • You send me an email at [email protected].