Hands-on-WebScraping issues

No tweets are being scraped.

5

Hashtags are found, but it doesn`t find any tweets. I have lowerd the setting (delay and concurrency) and set ROBOTSTXT_OBEY to false. Any tips?

iprelic

HTTP Status Code Is Not Handled Or Not Allowed

1

Uh oh...did Twitter break us? Do we have the change the user_agent in settings.py?

Huntley30

scrapy list command doesnt work

2

all the requirements were successfully installed but 'scrapy list' command didnt work giving the error "'scrapy' is not recognized as an internal or external command, operable program or batch file."...

Muhammad406

Isn't Working?

This scraper had been working for me until today. Have anyone had the same problem or is only happening to me? Thank you very much

terumask

2020-09-17 22:42:08 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <404 https://mobile.twitter.com/hashtag/>: HTTP status code is not handled or not allowed

ADITYA727

pip install -r requirements.txt --user issue

I believe the correct dependency name is python-dateutil, not dateutil.

saliudev

unable to import get_links

` $ scrapy list Traceback (most recent call last): File "/home/iseadmin/anaconda3/bin/scrapy", line 10, in sys.exit(execute()) File "/home/iseadmin/anaconda3/lib/python3.6/site-packages/scrapy/cmdline.py", line 142, in execute cmd.crawler_process = CrawlerProcess(settings) File "/home/iseadmin/anaconda3/lib/python3.6/site-packages/scrapy/crawler.py", line 280, in __init__...

vahuja4

crawling by time periods

Hi Amit, great cralwer!! well done :) Is it possible to add to the crawler the ability to crawl specific periods? right now, its running perfectly and crawl mostly 2020....

chupit

Pulling All Tweets

5

Hey, quick question. When I ran this using that hashtag, BigData, it pulled all tweets containing the words data or big data. Why is it not only pulling tweets with...

kozakalec

Issues installing libraries in python 3.8

1

Initially, I got an error for dateutil not having a valid version for my current install, then I read that python-dateutil should be a subsitute, but when installing that I...

adalast

Hands-on-WebScraping
Hands-on-WebScraping copied to clipboard

Metadata

No tweets are being scraped.

HTTP Status Code Is Not Handled Or Not Allowed

scrapy list command doesnt work

Isn't Working?

2020-09-17 22:42:08 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <404 https://mobile.twitter.com/hashtag/>: HTTP status code is not handled or not allowed

pip install -r requirements.txt --user issue

unable to import get_links

crawling by time periods

Pulling All Tweets

Issues installing libraries in python 3.8

← Metadata

Owner

Metadata

Hands-on-WebScraping Hands-on-WebScraping copied to clipboard

Metadata

← Metadata

Owner

Metadata

Hands-on-WebScraping
Hands-on-WebScraping copied to clipboard