Raymond Berger
Raymond Berger
How long will you have to wait to load all the images and compute the hash? For 40k images at 2 seconds each would be less than a day which...
Please upload the list (of both matching and not matching) here as a CSV and then I'm sure Scott can provide more info on what to do next.
It might also be a good idea to prevent these images from uploading in the future by hardcoding that hash into the upload pipeline?
@OutstandingWork I think you should be able to do it with the editions and works dumps. Also using duckdb (an example is in the dump page) you should be able...
@OutstandingWork are you still willing to work on this one?
@sbwhitt I think yours is a big improvement! At the same time, I think we should drastically simplify the design long term. Just have next/prev (and maybe first/last). For comparison...
@sbwhitt I assigned you. I know it'll be a slow next month or so as the team is away for the holidays but it would be awesome if you wanted...
https://github.com/internetarchive/openlibrary/pull/10069 is the example. Might have to put a little thinking into what they keys will be for each state. I think they probably should be relative like the words...
@sbwhitt fyi this if for the last 30 days.
@sbwhitt are you still interested in working on this?