Chinese-Word-Vectors icon indicating copy to clipboard operation
Chinese-Word-Vectors copied to clipboard

how to get the corpus like 'Financial News 金融新闻'?

Open SeekPoint opened this issue 6 years ago • 1 comments

SeekPoint avatar Oct 13 '18 14:10 SeekPoint

We list corpus in the Corpus, which can be downloaded. The rest of corpus are grabbed from the Internet via a scrapy. Because of the copyright, if they are released, we could face some legal risks. I'm very sorry for that. But with a simple scrapy, it is easy to get a mount of data in several days.

shenshen-hungry avatar Oct 15 '18 03:10 shenshen-hungry