Sơn Trần-Nguyễn

Results 50 issues of Sơn Trần-Nguyễn

So target server can send 304 if applicable.

One run can have duplicate URLs (a product page can belong in multiple categories). Need to avoid re-scraping them.

During scraping, sometimes the following exception is thrown: ``` ** (BadFunctionError) expected a function, got: %Scrapex.GenSpider.Request{pid: nil, ref: nil, url: ""} 2015-11-24T04:14:25.182758138Z app[web.1]: (elixir) lib/enum.ex:1959: Enum.do_find/3 2015-11-24T04:14:25.182873123Z app[web.1]: lib/scrapex/gen_spider.ex:610: Scrapex.GenSpider.handle_info/2...

bug

Right now, even though `Mod.parse/2` can return a (list of) `GenSpider.Request`, `GenSpider` still `await` for it, thus blocking the callee. Need to handle the response message separately when it comes...

Provide a CLI application to use configuration in a target directory, including `GenSpider` callback module, and output data to files.

We already have a sample spider in test and a StackOverflow spider in `GenSpider`'s module doc, but we want a more advanced spider to use all features.

Handle cases such as 500, 403, etc..

#### What is your current rclone version (output from rclone version)? rclone v1.65.1 - os/version: darwin 14.4.1 (64 bit) - os/kernel: 23.4.0 (x86_64) - os/type: darwin - os/arch: amd64 -...

Hello, Our team were interested in running an SSH client on Cloudflare Workers, and `ssheasy` seems to be a great fit. However, we hit a snag of not being able...

## Steps to Reproduce For all purposes, below is the example from the homepage, with a slight modification for the 7th and 8th slides to have width of 75% instead...