proose
proose copied to clipboard
Image fetching with proose
Hi Matt, Thanks for the wonderful library. I am using it as a major component in my latest project. It is working fine while extracting title, text, keywords etc. However i have a problem with the image fetching. As like the native goose this one also have the capability to extract image and that is good. However while i extract multiple url's continuously , the image url which it is returning in the output is always the first one, though other data appears correct.
Is there any configuration change i need to make or any code adjustment to overcome this issue? Please provide a solution if possible or else a direction to solve the issue. Any help is greatly appreciated.
It seems you are using goose 1.4 Is it the problem with that? the latest stable version is 2.1.3
Hi Matt,
We even tried replacing the jar file of the goose library with the latest version (2.0.2) but that also did not work. After replacing the jar file, we keep getting the error "Function importClass must be called with a class; had "[JavaPackage com.gravity.goose.Configuration]" instead." even when the "com/gravity/goose/Configuration.class" exists in the new jar.
At this point, we are completely stuck. I guess we do not know enough about Prudence or Goose to make the fix.
I will appreciate if you could guide us on how to fix image fetching in either the goose version (1.4.1) packaged with proose or the latest goose version (2.0.2).
Thanks, Rajiv