article icon indicating copy to clipboard operation
article copied to clipboard

Cannot read property 'title' of undefined

Open AndreasMadsen opened this issue 11 years ago • 0 comments

My crawler fails quite a lot on rt.com ( it crawl all sorts of websites ), this is what my log reports:

2014-03-14 04:15:30.393   3
what:   article crawler
action: task execution failed
error:  
    message: Cannot read property 'title' of undefined
    name:    TypeError
    stack:   |
        TypeError: Cannot read property 'title' of undefined
            at Fase2Algortime._reduceImage (/home/ubuntu/code/newlysis/node_modules/article/lib/fase-2/all.js:97:48)
            at Fase2Algortime.reduce (/home/ubuntu/code/newlysis/node_modules/article/lib/fase-2/all.js:188:8)
            at Article._end (/home/ubuntu/code/newlysis/node_modules/article/lib/article.js:51:17)
            at Object.TreeBuilder._parser.Parser.onend (/home/ubuntu/code/newlysis/node_modules/article/lib/treebuilder.js:52:12)
            at Parser.onend (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Parser.js:277:32)
            at Tokenizer.end (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Tokenizer.js:813:12)
            at Parser.end (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Parser.js:306:18)
            at Article.<anonymous> (/home/ubuntu/code/newlysis/node_modules/article/lib/treebuilder.js:64:18)
            at Article.g (events.js:180:16)
            at Article.EventEmitter.emit (events.js:92:17)
href:   |
    http://rt.com/news/merkel-ukraine-answer-sanctions-658/
2014-03-14 04:15:30.235   3
what:   article crawler
action: task execution failed
error:  
    message: Cannot read property 'title' of undefined
    name:    TypeError
    stack:   |
        TypeError: Cannot read property 'title' of undefined
            at Fase2Algortime._reduceImage (/home/ubuntu/code/newlysis/node_modules/article/lib/fase-2/all.js:97:48)
            at Fase2Algortime.reduce (/home/ubuntu/code/newlysis/node_modules/article/lib/fase-2/all.js:188:8)
            at Article._end (/home/ubuntu/code/newlysis/node_modules/article/lib/article.js:51:17)
            at Object.TreeBuilder._parser.Parser.onend (/home/ubuntu/code/newlysis/node_modules/article/lib/treebuilder.js:52:12)
            at Parser.onend (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Parser.js:277:32)
            at Tokenizer.end (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Tokenizer.js:813:12)
            at Parser.end (/home/ubuntu/code/newlysis/node_modules/article/node_modules/htmlparser2/lib/Parser.js:306:18)
            at Article.<anonymous> (/home/ubuntu/code/newlysis/node_modules/article/lib/treebuilder.js:64:18)
            at Article.g (events.js:180:16)
            at Article.EventEmitter.emit (events.js:92:17)
href:   |
    http://rt.com/news/chinese-jet-cyber-espionage-stolen-718/

AndreasMadsen avatar Mar 30 '14 09:03 AndreasMadsen