gnparser
gnparser copied to clipboard
GNparser normalises scientific names and extracts their semantic elements.
Psammophanes subgen. Psammophrynopsis Koch, 1953 parses fine, with a warning. subgen. Psammophrynopsis Koch, 1953 doesn't get parsed at all. It would be good if names preceded by subgen. would get...
make it easier to distinquish `candidatus` names by adding a field for them.
Given the following assumptions are always true: * A string contains one or more of name, author, and year * That string contains *only* those elements (i.e. there is never...
@Archilegt writes in https://github.com/gnames/gnparser/issues/201 One question: Are "wordType" values open to changes? For example: genus to [genericName](https://dwc.tdwg.org/terms/#dwc:genericName) species to [specificEpithet](https://dwc.tdwg.org/terms/#dwc:specificEpithet) infraspecies to [infraspecificEpithet](https://dwc.tdwg.org/terms/#dwc:infraspecificEpithet).
@abubelinha mentioned in https://github.com/gnames/gnverifier/issues/64 Currently parsing of `cf.` annotation works only for species `Aus cf. bus` but not for infraspecies `Aus bus cf. cus`
recognizing "species group" or "species complex" suffixes as indicators of infrageneric groupings
created by @mtholder at https://gitlab.com/gogna/gnparser/-/issues/55 (First off: thanks for all the work on gnparser! It is shockingly efficient, precise, and complete.) This may well be out of scope, but I...
@abubelinha raised the following in #199: In summary, for the ö case, I think o is a much more conservative approach than oe (which looks like a germanic phonetic replacement,...
We should try to parse names like ``` Acalypha australis L.var.genuina Nakai Viburnum plicatum Thunb. var.tomentosum Miq. Hermbstaedtia odorata var.cf.odorata Skimmia japonica var.intermedia f.repens ```
It should increase performance another 5-10%, it would also remove dependency on ragel