inspire icon indicating copy to clipboard operation
inspire copied to clipboard

Bibcheck: added regexp_replace rules to rules.cfg

Open cleggm1 opened this issue 7 years ago • 4 comments

Added 12 rules to normalize 65017a in HEPNames using regexp_replace.

cleggm1 avatar Dec 20 '17 20:12 cleggm1

@michamos any comment on this?

seems ok, if inefficient, to me

tsgit avatar Dec 21 '17 16:12 tsgit

/opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161260.log changed 4805 records to physics.acc-ph /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161266.log changed 377 records to physics.atom-ph /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161268.log changed 120 records to nlin.CD /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161269.log changed 307 records to physics.med-ph /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161273.log changed 73 records to physics.ao-ph /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161272.log changed 217 records to physics.plasma-ph /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161271.log changed 112 records to nucl-ex /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161270.log changed 80 records to nucl-ex /opt/cds-invenio/var/log/bibsched/116/bibsched_task_1161280.log changed 537 records to physics.comp-ph

tsgit avatar Dec 22 '17 19:12 tsgit

the commit message is not very informative Bibcheck: added regexp_replace rules to rules.cfg better would be: BibCheck: rules to normalize HepNames subjects for display in short log. Also, subtitles or further explanation should be indented 4 chars and started with a * for each bullet point or paragraph

tsgit avatar Dec 22 '17 19:12 tsgit

Those that correspond to renamings on arXiv, e.g. chao-dyn -> nlin.CD are already handled in dojson, but it doesn't hurt to convert them on legacy.

michamos avatar Jan 08 '18 10:01 michamos