ripme icon indicating copy to clipboard operation
ripme copied to clipboard

Downloading subs on sites like Reddit dumping too many folders

Open Likander opened this issue 8 months ago • 6 comments

When downloading subs from sites like Reddit there are a LOT of posts with 2-3 images that get their own folder. When the rip is done instead of having a bunch of images I can arrange by date, I have 50 different folders with 2-3 images I have to slog through. The old ripper ignored this and just downloaded everything into 1 folder. How do I get the new one to repeat this behavior?

Likander avatar Mar 29 '25 13:03 Likander

When downloading subs from sites like Reddit there are a LOT of posts with 2-3 images that get their own folder. When the rip is done instead of having a bunch of images I can arrange by date, I have 50 different folders with 2-3 images I have to slog through. The old ripper ignored this and just downloaded everything into 1 folder. How do I get the new one to repeat this behavior?

  1. Find the rip.properties file:

    • Windows: %LOCALAPPDATA%\ripme\rip.properties
    • Linux: ~/.config/ripme/rip.properties
    • macOS: ~/Library/Application Support/ripme/rip.properties
  2. Open it in a text editor and add this line:

    reddit.save_by_subreddit=true

  3. Save the file and restart RipMe.


Now RipMe will download everything into a single folder per subreddit, instead of a new folder for each post.

Vaibhaverma-1 avatar Apr 06 '25 04:04 Vaibhaverma-1

@soloturn

Hi! I’d love to work on this issue if it’s still available. Could you please assign it to me? Thanks!

Vaibhaverma-1 avatar Apr 06 '25 04:04 Vaibhaverma-1

I tried the fix but it didn't work. I primarily use the .jar executable file btw if that makes any difference. If I knew how to assign issues I would.

Likander avatar Apr 06 '25 20:04 Likander

The old ripper ignored this and just downloaded everything into 1 folder. How do I get the new one to repeat this behavior?

Must be a really old version because this behavior is there since 1.7 (>4 years ago).

revanwolf avatar Apr 09 '25 20:04 revanwolf

@Vaibhaverma-1 I assigned to you. You can feel free to work on things without the issue being explicitly assigned to you though

metaprime avatar Apr 18 '25 04:04 metaprime

In general I'd like to change how filenames get created for reddit. Subfolders probably seemed like a good idea to someone at some point but they should include the post ID instead of just the name, for one thing, and I really think the date of the post should be included in the filename for reliable sortability by name. In most cases image number isn't needed, but maybe makes sense for downloading albums attached to posts. If done the right way with a naming scheme, these could be sorted by date/time and by post as well.

Something like YYYY-MM-dd_HHmmss_postID_title.gif YYYY-MM-dd_HHmmss_postID_title_01.jpg (album 1st image) YYYY-MM-dd_HHmmss_postID_title_02.jpg (album 2nd image)

Instead of .\title\othersitegid_01_imageid.jpg where title can be the same across multiple posts and now I have this garbage to sort out:

Image

And especially for users that do a lot of self-promoting, the same images tend to get posted again and again... so having functionality in Ripme to hash the data and say "we already have that image" during the rip itself would be really useful. Personally I'd like to keep the first instance chronologically and delete the one with a newer timestamp.

I've had these ideas kicking around in my head for years and have never brought myself to take the time to implement.

For the deduplication bit, I just run a separate simple tool after the download to detect and delete duplicates by their hashes, which is mildly annoying (but does the job) and loses context that Ripme has.

metaprime avatar Apr 18 '25 05:04 metaprime