inspire-next
inspire-next copied to clipboard
HoldingPen: Workflow: online first
Dealing with online first articles (DOI but no full pubnote): We don't want to curate / manually merge stuff twice. E.g. Elsevier sends an update for every step. 2 possibilities:
- auto-reject (without blocking the following full article). This is what we are currently doing. It is possible to filter these records at DESY before sending xml to labs.
- normal selection + ingest or auto-merge. I.e. do everything that can be done automatically and forget about information that would cause conflicts. Curation should be triggered only for record with full pubnote. The following versions would be matched automatically via DOI, so we don't have to do that again
Micha: If you can filter them out easily, that would be easier. As this kind of things is probably publisher/journal dependent, it's something that we probably want to handle in hepcrawl (when we have non-DESY crawlers), not in the workflow.
Kirsten will discuss this internally in DESY
will filter them out before sending xml to labs. Has to be solved for non-DESY spiders.