colly
colly copied to clipboard
Elegant Scraper and Crawler Framework for Golang
Hi there, In my case I have 500k urls that i'm going to crawl with gocolly. Is there a way to limit number of active goroutines somehow? I call `.Visit`...
Hello,also I read [https://github.com/gocolly/colly/issues/445](445) and [https://github.com/gocolly/colly/issues/438](438),but i can't read Request.body string.And i try to put it CONTEXT,but it does not work. I look up a lot of information,the 'ioutil.ReadAll' is...
While trying below code I only get a single link, but the website is full of links. `Visiting http://teenage.engineering` What is happening? Thanks for any hints. ``` package main import...
I'm attempting to extract/locate Javascript code within an HTML page; whilst Colly is not a headless browser and hence, JS execution is not a feature, I don't actually need to...
Hello, I use go-colly to crawl through website while with pagination. I set async to true and parallelism is set to 5. I also set timeout to 1 minute. Though...
I did make Scraping for Amazon Product Titles but Amazon captcha catches my scraper. I tried 10 times with go run main.go(8 times catches me - 2 times I scrapped...
Hello, First, great package! I'm trying to make a scraper to get, from various websites, some kind of images. It works great, I have basically a CLI and a go...
The new Cascadia version includes: - Case-insensitive CSS selectors attributes with `i`(https://github.com/andybalholm/cascadia/pull/51) - Support of a lot of pseudo-classes (https://github.com/andybalholm/cascadia/issues/50) Goquery has recently upgraded Cascadia too.
I am trying login to my amazon buyer account for getting tracking info. I made wordpress-woocommerce login and getting infos but I could not for Amazon. ``` package main import...
Using https://github.com/prologic/go-gopher