python-goose
python-goose copied to clipboard
Bad case for image extraction
Hi, I use goose to extract images from a Chinese news site. Some news articles dont't have images. But goose gives me one from the sidebar of the page.
For example: url: http://news.xinhuanet.com/fortune/2013-10/10/c_125507992.htm
And Goose give me this image: http://news.xinhuanet.com/fortune/titlepic/117523073_title1n.jpg
This image is at the right site of this page.
How can I fix it!
Thanks for your help.
This bug is fixed.