DOI2BibTeX.jl icon indicating copy to clipboard operation
DOI2BibTeX.jl copied to clipboard

Philos. Trans. R. Soc., A abbreviation

Open thchr opened this issue 2 years ago • 1 comments

Consider the following horror show of a DOI:

julia> doi2bib("10.1098/rsta.1904.0024"; abbreviate=false)
@article{johndoe1904,
  doi = {10.1098/rsta.1904.0024},
  year = 1904,
  volume = {203},
  number = {359-371},
  pages = {385--420},
  title = {{XII}. Colours in metal glasses and in metallic films},
  journal = {Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character}
}

The correct journal abbreviation is Philos. Trans. R. Soc., A according to CASSI. We are currently giving back Philos. Trans. R. Soc. London. Ser. A, Contain. Pap. Math. Or Phys. Character. I.e., we ought to be disregarding everything that follows (and includes) Series A. It seems we also ought to be removing London - but this is really beyond the ISO-4 rules. Part of the problem is that the journal name returned by the GET request has more text than it ought to - but we still ought to be doing better.

A sub-part of the problem is also that we are abbreviation 'Series' to 'Ser.' rather than just discarding it.

thchr avatar Dec 14 '22 03:12 thchr

A sub-part of the problem is also that we are abbreviation 'Series' to 'Ser.' rather than just discarding it.

Oddly, "series" is included in LTWA_ENTIREWORD. Maybe because it could be in reference to a mathematical series?

thchr avatar Mar 29 '23 15:03 thchr