machine-learning-dataset topic
nlp-public-dataset
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
Machine-Learning-Problems-DataSets
We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit...
ClaMP
A Malware classifier dataset built with header fields’ values of Portable Executable files
2DGeometricShapesGenerator
2D Geometric shapes generator
jazznet
jazznet dataset of piano patterns for music audio machine learning research