gutenberg icon indicating copy to clipboard operation
gutenberg copied to clipboard

Create smaller selections

Open Popolechien opened this issue 2 years ago • 5 comments

The Gutenberg zim currently stands at 70GB in English, which has generated comments from users that it is rather unwiedly.

It would be interesting to offer fiction / non fiction subsets or anything based on the projects' bookshelves

Popolechien avatar May 11 '23 06:05 Popolechien

What would be the input? A list of bookshelves IDs to include?

rgaudin avatar May 11 '23 07:05 rgaudin

Would this be the easiest way to implement such an idea?

Alternatively, at this stage I don't think that the input should be left to users (or any curator), so maybe having the scraper automatically generate a zim for each bookshelf ID might be less labour intensive first step.

Popolechien avatar May 11 '23 07:05 Popolechien

I see 👍 probably a good first step

rgaudin avatar May 11 '23 07:05 rgaudin

The PG bookshelves are currently not maintained; they used to be maintained on a wiki that got shut down because the underlying wiki software had security issues. Re-enabling the bookshelf management is a project that was worked on a year ago but didn't reach the finish line. So it might be a good idea to wait on this.

eshellman avatar May 11 '23 14:05 eshellman

The PG bookshelves are currently not maintained; they used to be maintained on a wiki that got shut down because the underlying wiki software had security issues. Re-enabling the bookshelf management is a project that was worked on a year ago but didn't reach the finish line. So it might be a good idea to wait on this.

@eshellman Thank you for this important feedback. I guess the problem is mostly not technical. Maybe this can be done somewhere in a dedicated code repository? For example github has a small wiki engine and a wiki is available for each repository. Anyway, that sounds problematic to implement this feature if the PG shelfs are not maintained!

kelson42 avatar Aug 18 '23 02:08 kelson42