cnn-dailymail
cnn-dailymail copied to clipboard
Titles of articles
Hi,
In the *.story files the titles of the news articles are absent. Is there a way to get the titles?
url_list contains all orginal link. you can get all link from there. hash code of *.story is generated from url. Example: https://www.browserling.com/tools/text-to-hex become this. 000efdbb001fd19666b37456e239c78c52908655
Try my repository and make it run: https://github.com/abisee/cnn-dailymail#option-1-download-the-processed-data