Jan Kinne

Results 4 comments of Jan Kinne
trafficstars

Hi both sites seem to rely quite heavily on java script which may cause the problem. Another issue may be that the text is not enclosed by html tags ARGUS...

Could be because of JavaScript or some kind of delayed loading. Not sure to be honest. How frequent is that issue in your dataset?

Yeah, you can change any scrapy related settings in the settings file: https://github.com/datawizard1337/ARGUS/blob/4f61679595f305d3587caaedb030a1884c2f422e/build/lib/ARGUS/settings.py Check out https://docs.scrapy.org/en/latest/topics/settings.html for more info. And don't forget to deploy your changed project as described above.

My knowledge about splash is very limited. I think the implementation is not straight forward, especially if you want to combine it with ARGUS. Sorry!