openlibrary icon indicating copy to clipboard operation
openlibrary copied to clipboard

Importing books based BWB cover zips

Open scottbarnes opened this issue 1 year ago • 0 comments

Problem

In #9815 it was determined that BWBCoverBot is doing a fantastic job and we were wrong to ever doubt it. As such, it is faithfully importing covers, and the reason so few covers were imported when it was run across all of the BWB cover zips is that a great many of the covers are not associated with an ISBN that's recorded in Open Library.

Context

No response

Breakdown

Requirements Checklist

If we wish to make use of these covers, we should:

  • [ ] Take a sample of say 50 random covers from a few zips, look over them, and see if they subjectively look 'useful'. This is because in a limit skim, a number were independently published, or simply didn't have metadata available via our existing metadata sources.
  • [ ] Verify that, if they look useful, we have metadata sources with sufficiently complete and accurate metadata.
  • [ ] If they still look useful, import a sample, and see how things ultimately look and determine whether we need to change the import process to be more accurate. The reason for this is that it could in theory import ~4.7 million books, so getting it wrong would be not great.

Related files

Stakeholders

  • @mekarpeles

Instructions for Contributors

  • Please run these commands to ensure your repository is up to date before creating a new branch to work on this issue and each time after pushing code to Github, because the pre-commit bot may add commits to your PRs upstream.

scottbarnes avatar Oct 02 '24 02:10 scottbarnes