Tom Morris
Tom Morris
Just discovered a hack which might help @seabelis - if you click the "Retry" button on the error page, it will refer you to the original URL. Problem solved! (Don't...
Oddly, https://openlibrary.org/books/OL16775111M.json returns a rev 14 JSON with the author info included, while https://openlibrary.org/books/OL16775111M.json?v=14 returns a different rev 14 JSON which excludes the author info, so the info is getting...
> The original characters in this example are also not being imported, that would be a new feature, but there may be existing code for other fields to fetch 880...
> Although, since MARC 710 is 710 - Added Entry-Corporate Name, it makes sense that this field should never be rearranged as a personal name... needs some thought. Actually, there...
I haven't investigated in depth, but something suspicious that catches my eye is that the $6 subfield isn't listed for any of these entries: https://github.com/internetarchive/openlibrary/blob/bf4bc9d8e9d4f3bda987215a310c959b23ca6d52/openlibrary/catalog/marc/parse.py#L580-L585 but in any case, it...
#628 is related. It sounds like @hornc may have some ideas for where the problems are located, but if he doesn't get around to it first, I'll add this to...
> Maybe we've missed the point here. Why are author names in the index of works to begin with, rather than just author identifiers? Because that way they can be...
Page caching can be an issue, but the JSON Search API shows the same stale author name 13 hours later https://openlibrary.org/search.json?q=OL47061593M and the search result has a timestamp of `last_modified_i:...
> Whether the assumptions that were present then are still valid today, or whether there are any performance issues, is still an open question that needs investigation. Hopefully @mekarpeles agrees...
I've been keeping an eye out for these and I'm pretty sure it's currently being done wrong / sub-optimally. I'm not sure if different catalogers use different rules, but there...