cnn-dailymail icon indicating copy to clipboard operation
cnn-dailymail copied to clipboard

Titles of articles

Open aburkov opened this issue 7 years ago • 2 comments

Hi,

In the *.story files the titles of the news articles are absent. Is there a way to get the titles?

aburkov avatar Feb 10 '18 04:02 aburkov

url_list contains all orginal link. you can get all link from there. hash code of *.story is generated from url. Example: https://www.browserling.com/tools/text-to-hex become this. 000efdbb001fd19666b37456e239c78c52908655

the-black-knight-01 avatar Dec 03 '18 11:12 the-black-knight-01

Try my repository and make it run: https://github.com/abisee/cnn-dailymail#option-1-download-the-processed-data

JafferWilson avatar Dec 03 '18 12:12 JafferWilson