Work dumps include authors without a key
Works in the works dump have authors without a key, that are not present in the api response.
Evidence / Screenshot (if possible)
Authors of OL24527121W in the work dump: "authors": [{"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239668A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239669A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239670A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239671A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239672A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239673A"}, "type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}, {"type": {"key": "/type/author_role"}}]
api response:
{"title": "Medicina: Aspectos Epidemiol\u00f3gicos, Cl\u00ednicos e Estrat\u00e9gicos de Tratamento", "authors": [{"author": {"key": "/authors/OL9239668A"}, "type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239669A"}, "type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239670A"}, "type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239671A"}, "type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239672A"}, "type": {"key": "/type/author_role"}}, {"author": {"key": "/authors/OL9239673A"}, "type": {"key": "/type/author_role"}}], "key": "/works/OL24527121W", "type": {"key": "/type/work"}, "latest_revision": 1, "revision": 1, "created": {"type": "/type/datetime", "value": "2021-05-25T16:10:52.471943"}, "last_modified": {"type": "/type/datetime", "value": "2021-05-25T16:10:52.471943"}}
Relevant url?
Steps to Reproduce
- Read thourgh the work dump
- Find that work in the work dump
- Observe that authors array
- Compare to the api response
Related files
script/oldump.sh script/oldump.py https://github.com/internetarchive/openlibrary/blob/ceca4ccca599ba4d2660d687937cdd85fc8b9a08/openlibrary/data/dump.py#L328
Stakeholders
@cdrini
Hmm 🤔 @RayBB would you by any chance be able to verify if the latest dumps still has this issue?
Still there in the July dump, which is the latest I have handy.
Ah ok, thank you @tfmorris so seems like still an issue.