gallery-dl icon indicating copy to clipboard operation
gallery-dl copied to clipboard

[Request] Site support for nsfw nudecollect dot com

Open junomarian opened this issue 2 years ago • 9 comments

Dear, dev! It would be great to add this site. They ask for premium membership for downloading whole imagesets. If you add nudecollect support in next update it would be so great. Thanks for developing such nice tool

NSFW Example gallery with 4 pages https://www.nudecollect dot com/content/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/index-mirror-11-65.html

junomarian avatar Mar 19 '22 12:03 junomarian

I also requested, how to rip galleries from nudecollect on vipergirls forum. Good dev, created a powershell script to scrape. https://vipergirls.to/threads/5819045-Automating-Downloads-using-Powershell?p=102974387&viewfull=1#post102974387

junomarian avatar Mar 23 '22 19:03 junomarian

I also requested, how to rip galleries from nudecollect on vipergirls forum. Good dev, created a powershell script to scrape. https://vipergirls.to/threads/5819045-Automating-Downloads-using-Powershell?p=102974387&viewfull=1#post102974387

Can't be accessed without account?

You could at least post the ps1 here, or put it into a gist or something.

Hrxn avatar Mar 24 '22 14:03 Hrxn

I also requested, how to rip galleries from nudecollect on vipergirls forum. Good dev, created a powershell script to scrape. https://vipergirls.to/threads/5819045-Automating-Downloads-using-Powershell?p=102974387&viewfull=1#post102974387

Can't be accessed without account?

You could at least post the ps1 here, or put it into a gist or something.

https://gist.github.com/junomarian/8d7f1ff6aae0b6cde961afab83dabd7a here you go sir.

junomarian avatar Mar 24 '22 14:03 junomarian

Thank you, sir. Although it should be mentioned, maybe, that this is not pure Powershell, as it depends on Selenium + ChromeDriver.

Hrxn avatar Mar 24 '22 15:03 Hrxn

Thank you, sir. Although it should be mentioned, maybe, that this is not pure Powershell, as it depends on Selenium + ChromeDriver.

So is this website out of the scope of gallery-dl? Is it possible to add support for it dev?

junomarian avatar Mar 24 '22 18:03 junomarian

This may not be possible to do directly with gallery-dl. When you click on a thumbnail on the gallery page, you get a wait message while it executes a script in the background that finally displays the full-size image embedded on the page which, when clicked, opens the image alone on the page. You can 'right-click' the image, embedded or not, and copy its url. The images themselves all have the same 'template' with only the image number changing for each image.

For example, clicking the first image of the first page of the given gallery, https://www.nudecollect.com/content/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/image-1-pics-65-mirror-11.html, displays: https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image00001-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

Clicking the last image of the last page of the given gallery, https://www.nudecollect.com/content/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/image-65-pics-65-mirror-11.html, displays: https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image00065-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

Notice that the only change is from image00001 to image00065.

So the template is:

https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image[00001-00065]-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

I couldn't figure out how to use the gallery-dl --range option, there are no real-world examples, but I could easily download a range of the full-size gallery images using curl's ranging capability (image[00001-00005]):

curl "https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image[00001-00005]-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg" -o "WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_#1.jpg" outputs the first 5 images renamed to WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_00001.jpg - WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_00005.jpg.

The images are obscenely large, 5304×7952!

The bottom line is that you can easily get the range of urls from the first and last images of a gallery and use curl.

PacoH avatar Aug 12 '22 18:08 PacoH

This may not be possible to do directly with gallery-dl. When you click on a thumbnail on the gallery page, you get a wait message while it executes a script in the background that finally displays the full-size image embedded on the page which, when clicked, opens the image alone on the page. You can 'right-click' the image, embedded or not, and copy its url. The images themselves all have the same 'template' with only the image number changing for each image.

Why do you think that? Because of this wait message script? On the other hand, if the filename template is always the same, in other words, is predictable, this should be doable ...

The images are obscenely large, 5304×7952!

You think that these may actually be originals?

Hrxn avatar Aug 13 '22 02:08 Hrxn

Yes. This landing spot has a script that handles everything and I don't know if there is a way to execute it remotely.

<
div style = "width:950px;position:relative;margin:auto;" > < script type = "text/javascript"
src = "https://www.nudecollect.com/adt3.php" > < /script></div >
<
script > (function() {
    var js = "window['__CF$cv$params']={r:'73a270b47cc57cd9',m:'WYZ1z10wN5Xzh8O7qRAV2jJzOSydrcoXYlPEHCDIKUI-1660404085-0-Ac4Tgyjy4tvcrKLLxI+d0+phUHKqrZWb7EgfeqpFTewmXpDJOhmG+rdRCRmBdaRCkYMBE7dRbDRAqVF4mveaSsTATuirdKvyuxZY/m+5M0FUNz/muKRY6/b7a0UEC2KINX6mPv2Z9eq4+fnnzMh3YZMcauP3kWHx4sizKXZNwRiYzHPoTLqd/mudtZtToup+Upc2igZplaNdW2wKQvD/unbJJgM6/8Hq7yk+f63LXn3Y',s:[0x9772580968,0x57ad8f7b4e],u:'/cdn-cgi/challenge-platform/h/g'};var now=Date.now()/1000,offset=14400,ts=''+(Math.floor(now)-Math.floor(now%offset)),_cpo=document.createElement('script');_cpo.nonce='',_cpo.src='/cdn-cgi/challenge-platform/h/g/scripts/alpha/invisible.js?ts='+ts,document.getElementsByTagName('head')[0].appendChild(_cpo);";
    var _0xh = document.createElement('iframe');
    _0xh.height = 1;
    _0xh.width = 1;
    _0xh.style.position = 'absolute';
    _0xh.style.top = 0;
    _0xh.style.left = 0;
    _0xh.style.border = 'none';
    _0xh.style.visibility = 'hidden';
    document.body.appendChild(_0xh);

    function handler() {
        var _0xi = _0xh.contentDocument || _0xh.contentWindow.document;
        if (_0xi) {
            var _0xj = _0xi.createElement('script');
            _0xj.nonce = '';
            _0xj.innerHTML = js;
            _0xi.getElementsByTagName('head')[0].appendChild(_0xj);
        }
    }
    if (document.readyState !== 'loading') {
        handler();
    } else if (window.addEventListener) {
        document.addEventListener('DOMContentLoaded', handler);
    } else {
        var prev = document.onreadystatechange || function() {};
        document.onreadystatechange = function(e) {
            prev(e);
            if (document.readyState !== 'loading') {
                document.onreadystatechange = prev;
                handler();
            }
        };
    }
})(); < /script>

But if you can get the url for one actual jpg and the number of images, that can be used to download the images.

I do think these images are originals but usually they are not this big.

PacoH avatar Aug 13 '22 16:08 PacoH

This may not be possible to do directly with gallery-dl. When you click on a thumbnail on the gallery page, you get a wait message while it executes a script in the background that finally displays the full-size image embedded on the page which, when clicked, opens the image alone on the page. You can 'right-click' the image, embedded or not, and copy its url. The images themselves all have the same 'template' with only the image number changing for each image.

For example, clicking the first image of the first page of the given gallery, https://www.nudecollect.com/content/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/image-1-pics-65-mirror-11.html, displays: https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image00001-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

Clicking the last image of the last page of the given gallery, https://www.nudecollect.com/content/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/image-65-pics-65-mirror-11.html, displays: https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image00065-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

Notice that the only change is from image00001 to image00065.

So the template is:

https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image[00001-00065]-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg

I couldn't figure out how to use the gallery-dl --range option, there are no real-world examples, but I could easily download a range of the full-size gallery images using curl's ranging capability (image[00001-00005]):

curl "https://mirror1.nudecollect.com/showimage/nudecollect-8769086487/image[00001-00005]-5896498214-11-9689595623/WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018/9879560327/nudecollect.com.jpg" -o "WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_#1.jpg" outputs the first 5 images renamed to WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_00001.jpg - WowGirls_Jia_Lissa_Taking_Charge_Dec_13_2018_00005.jpg.

The images are obscenely large, 5304×7952!

The bottom line is that you can easily get the range of urls from the first and last images of a gallery and use curl.

Will it be supported by gallery-dl in future? I have done as you mentioned above, its doable, but time consuming.

junomarian avatar Aug 19 '22 03:08 junomarian

FYI, just stumbled upon this, but it seems this site is gone? At least redirects to an entirely different domain. Only similarities are, uh, thematically.

Does not look like this new site is supposed to be a replacement of equal worth, seems sketchy to me, to be honest.

Edit

Pinging @enduser420

Hrxn avatar Sep 06 '23 00:09 Hrxn