gallery-dl icon indicating copy to clipboard operation
gallery-dl copied to clipboard

[Instagram] Avoid scrape warning

Open ashes-xda opened this issue 2 years ago • 9 comments

[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/feed/user/54655984401/'
[instagram][info] Use '-o cursor=2975840913239852315_54655984401' to continue downloading from the current position
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/feed/reels_media/'
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/highlights/5469402209/highlights_tray/'
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/clips/user/'
[6/6] https://www.instagram.com/xyz/
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/users/web_profile_info/'
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/users/web_profile_info/'
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/users/web_profile_info/'
[instagram][error] HttpError: '400 Bad Request' for 'https://www.instagram.com/api/v1/users/web_profile_info/'

Getting scrape warning too often these days on IG saying that they might permanently terminate my account (secondary account just for scrapping purpose) Is there anything to bypass this warning or something else that i can do to avoid this ?

ashes-xda avatar Dec 10 '23 18:12 ashes-xda

You could try setting the sleep and sleep-request options to something like 12 seconds and 2 seconds respectively.

chocoagua avatar Dec 11 '23 01:12 chocoagua

You could try setting the sleep and sleep-request options to something like 12 seconds and 2 seconds respectively.

What happens if I choose not to use cookies? Will the download include all posts and reels but exclude stories or highlights? Or are there any limitations for downloading without cookies, such as potential lower resolution or incomplete downloads, even if the profile is public? @chocoagua @mikf

ashes-xda avatar Dec 11 '23 18:12 ashes-xda

You could try setting the sleep and sleep-request options to something like 12 seconds and 2 seconds respectively.

What happens if I choose not to use cookies? Will the download include all posts and reels but exclude stories or highlights? Or are there any limitations for downloading without cookies, such as potential lower resolution or incomplete downloads, even if the profile is public? @chocoagua @mikf

Instagram will gives you lower resolution medias on non-login session. This is the case for downloading medias or just want to view it through the app/site

danrynr avatar Dec 12 '23 11:12 danrynr

What happens if I choose not to use cookies?

IG will redirect all requests to its login page after a (very) short while.

[urllib3.connectionpool][debug] https://www.instagram.com:443 "GET /accounts/login/?next=/api/v1/media/1875629777499953996/info/ HTTP/1.1" 200 None
[instagram][error] HTTP redirect to login page (https://www.instagram.com/accounts/login/)

(As for your initial question: I don't know. Maybe consider using instaloader instead of gallery-dl)

mikf avatar Dec 12 '23 12:12 mikf

(As for your initial question: I don't know. Maybe consider using instaloader instead of gallery-dl)

oh I tried instaloader initially but that was way too slow and always used to give forbidden error after every few minutes whereas gallerydl is super fast no comparison there. The only issue is getting a few errors these days, which I never faced before and I've been scraping for almost 6 months. I'll try using the sleep and sleep-request options and see how it goes thankyou for such an awesome program

ashes-xda avatar Dec 12 '23 18:12 ashes-xda

I'm having my accounts banned frequently even with sleep and sleep-request set, using browser cookies and vpn, what is your experience? thanks

dademiller360 avatar Feb 19 '24 09:02 dademiller360

VPN does not help here, it likely makes it worse.

Hrxn avatar Feb 19 '24 12:02 Hrxn

true, but if meta ban my ISP IP (static) it's going to be way worse :-(( I tried to login again into my account and it seems that it wasn't banned, but gallery-dl with the netscape cookies txt was (after few download) redirecting me to the login page (as an error) I refresh manually the cookies and it worked again. I was looking into the docs but I could not find any info about cookies.txt refresh, should I do something like this?

dademiller360 avatar Feb 20 '24 12:02 dademiller360

@ashes-xda @dademiller360

Any of you find any solutions? I'm using the following settings, but they're not helping:

"sleep-request": [30.0, 60.0],
"sleep-429": [60.0, 90.0],
"sleep": [30.0, 60.0],
"sleep-extractor": [30.0, 60.0],

nothing2obvi avatar Aug 05 '24 17:08 nothing2obvi