oltarasenko
oltarasenko
As it was discussed here: https://github.com/oltarasenko/crawly/pull/97#issuecomment-626565242 we want to build a lightweight (probably HTTP based) UI for the single node based Crawly operations. For people who don't want (or don't...
As it's discussed in https://github.com/oltarasenko/crawly/pull/165#discussion_r548901091 it's better to have mocks of a separate module with test spider
One of the problems I am constantly seeing is a need to extract new URLs. And I am looking for a way to simplify it for me and other people...
A lot of my crawl depends on proper user-agent strings. It's a bit hard to supply user agents using a config as we're doing now. It would be good to...
Looks like the problem is that this website does not have form element. Do you think there is a way to handle such cases? Traceback (most recent call last): File...
Creates a splash replacement. I have tested it with just one target so far, it's hard to say it's perfect, but it might be an alternative to splash that is...
Hey, I don't see how can remarketing work if 3rd party cookies are depricated. Could someone explain?
Hey @tsloughter I just wonder if you could make the release? I think I need your last changes of OTP23. Ironically my current project requires epmdless :)
Spiders show send a stop message to UI, in order to avoid displaying 'node_down' which does not reflect the reality
Some example is here: http://crawlyui.com/logs/267/list The count estimate for this query shows 320. The actual amount of rows in the table is 4. As a result, we have long pagination...