MMDownloader
MMDownloader copied to clipboard
일부 만화들에서 파싱 실패 발생
일부 만화들에서 파싱 실패 발생
상황
사이트 접속은 되나, 만화 다운시 파싱 실패(커넥션 타임아웃)
Check list
- [x] 비번 걸린만화 - 아님
- [x] User-Agent 차단 - 아님
- [x] reCaptcha 와 같은게 추가 - 아님
원인 (추정)
마루마루 사이트는 IP 를 2개 할당받아서 서비스 하는데, 그 중 1개가 소켓 타임아웃 발생
LOCAL-[occidere:~/git/MMDownloader] curl -Iv http://wasabisyrup.com/archives/Ea1YsYUI-xc
* Trying 104.23.140.16...
* TCP_NODELAY set
* Connection failed
* connect to 104.23.140.16 port 80 failed: Operation timed out
* Trying 104.23.139.16...
* TCP_NODELAY set
* Connected to wasabisyrup.com (104.23.139.16) port 80 (#0)
> HEAD /archives/Ea1YsYUI-xc HTTP/1.1
> Host: wasabisyrup.com
> User-Agent: curl/7.54.0
> Accept: */*
>
< HTTP/1.1 200 OK
HTTP/1.1 200 OK
< Date: Sun, 18 Nov 2018 08:16:16 GMT
Date: Sun, 18 Nov 2018 08:16:16 GMT
< Content-Type: text/html; charset=utf-8
Content-Type: text/html; charset=utf-8
< Connection: keep-alive
Connection: keep-alive
< Set-Cookie: __cfduid=dd38bac0aea244a4a5207fd88687678f21542528976; expires=Mon, 18-Nov-19 08:16:16 GMT; path=/; domain=.wasabisyrup.com; HttpOnly
Set-Cookie: __cfduid=dd38bac0aea244a4a5207fd88687678f21542528976; expires=Mon, 18-Nov-19 08:16:16 GMT; path=/; domain=.wasabisyrup.com; HttpOnly
< Set-Cookie: PHPSESSID=20229b3c3e2881b1779add8e3e92176a; path=/
Set-Cookie: PHPSESSID=20229b3c3e2881b1779add8e3e92176a; path=/
< Expires: Thu, 19 Nov 1981 08:52:00 GMT
Expires: Thu, 19 Nov 1981 08:52:00 GMT
< Cache-Control: must-revalidate, no-cache, max-age=600
Cache-Control: must-revalidate, no-cache, max-age=600
< Pragma: public
Pragma: public
< X-Cache-Test: fresh
X-Cache-Test: fresh
< Accept-Ranges: bytes
Accept-Ranges: bytes
< X-Powered-By: CacheControl
X-Powered-By: CacheControl
< Referrer-Policy: same-origin
Referrer-Policy: same-origin
< X-Turbo-Charged-By: LiteSpeed
X-Turbo-Charged-By: LiteSpeed
< Server: cloudflare
Server: cloudflare
< CF-RAY: 47b8fe78d7fe98f5-LAX
CF-RAY: 47b8fe78d7fe98f5-LAX
<
* Connection #0 to host wasabisyrup.com left intact
대응
실패 시 다른 IP 로 추가 시도 (How?)