ImageScraper
ImageScraper copied to clipboard
:scissors: High performance, multi-threaded image scraper
If I would like to automate the process of downloading images from a user on social media (eg: Twitter, Facebook, Instagram), how could it be done? Can options to "sign...
Often now there are galleries with gifs replaced by gifv/webm pieces. Could you consider adding webm downloading feature to ImageScraper?
Hello, Trying to install ImageScraper on Windows 7 with Python 3.5, I get : Command "c:\python35\python.exe -u -c "import setuptools, tokenize;__file__='C:\\Users\\grele\\AppData\\Local\\Temp\\pip-build-3n1hjoe9\\setproctitle\\setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record C:\Users\grele\AppData\Local\Temp\pip-do0m1tx9-record\install-record.txt --single-version-externally-managed...
I executed: image-scraper "https://www.google.nl/search?biw=1920&bih=925&tbm=isch&sa=1&ei=L0aJWsWDJsjPwQKprY6gCA&q=error&oq=error&gs_l=psy-ab.3..0i67k1l2j0j0i67k1j0l4j0i67k1j0.8488.9037.0.10277.5.5.0.0.0.0.78.255.5.5.0....0...1c.1.64.psy-ab..0.5.252....0.TZSBrsxg2oo" --injected And got: ImageScraper ============ Requesting page.... c:\program files\python36\lib\site-packages\selenium\webdriver\phantomjs\webdriver.py:49: UserWarning: Selenium support for PhantomJS has been deprecated, please use headless versions of Chrome or Firefox instead...
I'm trying to scrape a large number of photos where all originals are between 300-800 px. Because they are on the site at one dimension, it downloads all of them...
While I start installing the ImageScraper, I got the following error in Windows 10 data:image/s3,"s3://crabby-images/273c4/273c49f880a540bd858f5a216ada52b42aa8be18" alt="imagescraper" Installing collected packages: setproctitle Running setup.py install for setproctitle: started Running setup.py install for setproctitle:...
cut and paste code from @kennedyshead
Installed ImageScraper with pip and pointed it to https://www.wikiart.org/en/recently-added-artworks and the response was: C:\Users\Think\VM\aml-1.6\dev>image-scraper -s C:\\Users\\Think\\VM\\watest\\images https://www.wikiart.org/en/recently-added-artworks ImageScraper ============ Requesting page.... Sorry, no images found. Do you know what the...
```python css = tree.xpath("//link[@type='text/css']/@href") css_images = list() for css_file in css: if not re.match(r'^[a-zA-Z]+://', css_file): css_file = self.url + css_file image_list = re.findall('url\(([^)]+)\)', requests.get(css_file).content.decode('utf-8')) for image in image_list: if image.startswith('//'):...