Define system for keeping track of what has already been uploaded
Kinds of uploads
- Full text
- images and other files
- metadata
It may be important to also signal whether the upload was by one of our tools, or by someone else (especially if that predates our upload attempt, see also #8 ).
What I have in mind here is basically a list (ideally cross-wiki) of most cited DOIs, along with standardized indications of
- what their license is
- whether the full text is on Wikisource
- whether the images are on Commons
- whether the metadata is on Wikidata (and synced with CrossRef), where we could also keep track of citations from Wikimedia projects, and of changes from CrossRef (e.g. after a correction or retraction)
- whether any other materials are on Wikimedia projects (e.g. quotes on Wikiquote)
- whether any of the uploads failed (e.g. due to file size limits, conversion issues), along with a link to the relevant bug
Putting on the agenda (#74) for today's meeting.
Yeah, basically the options that look OK, are using a BEACON or some sort of MongoDB serializer.
Should be discussed at our next meeting - adding #74.
The conclusion of this was broadly to:
Log everything that happens in the code. Publically make available those logs (either on wiki or through another web service). Then we have a signal-noise filtering problem which is easier to deal with.
Max Klein ‽ http://notconfusing.com/
On Sat, Jun 21, 2014 at 3:17 PM, Daniel Mietchen [email protected] wrote:
Should be discussed at our next meeting - adding #74 https://github.com/wpoa/OA-signalling/issues/74.
— Reply to this email directly or view it on GitHub https://github.com/wpoa/OA-signalling/issues/81#issuecomment-46766330.