inspire-next icon indicating copy to clipboard operation
inspire-next copied to clipboard

HoldingPen: Workflow: online first

Open ksachs opened this issue 6 years ago • 3 comments

Dealing with online first articles (DOI but no full pubnote): We don't want to curate / manually merge stuff twice. E.g. Elsevier sends an update for every step. 2 possibilities:

  • auto-reject (without blocking the following full article). This is what we are currently doing. It is possible to filter these records at DESY before sending xml to labs.
  • normal selection + ingest or auto-merge. I.e. do everything that can be done automatically and forget about information that would cause conflicts. Curation should be triggered only for record with full pubnote. The following versions would be matched automatically via DOI, so we don't have to do that again

ksachs avatar Jun 18 '18 13:06 ksachs

Micha: If you can filter them out easily, that would be easier. As this kind of things is probably publisher/journal dependent, it's something that we probably want to handle in hepcrawl (when we have non-DESY crawlers), not in the workflow.

ksachs avatar Jun 18 '18 13:06 ksachs

Kirsten will discuss this internally in DESY

StellaCh avatar Jun 20 '18 12:06 StellaCh

will filter them out before sending xml to labs. Has to be solved for non-DESY spiders.

ksachs avatar Jun 22 '18 08:06 ksachs