checklistbank
checklistbank copied to clipboard
field taxonID missing in WoRMS dataset
The field taxonID
is no longer present in the species/search API result, only for the WoRMS dataset (?)
This only happened in the last few months.
i.e. compare
https://api.gbif.org/v1/species/search?datasetKey=2d59e5db-57ad-41ff-97d6-11f5fb264527&offset=60000&limit=1
WoRMS, missing taxonID
vs.
https://api.gbif.org/v1/species/search?datasetKey=9ca92552-f23a-41a8-a140-01abaa31c931&offset=60000&limit=1
ITIS includes taxonID
If you lookup the species directly via the database it still has the taxonID: http://api.gbif.org/v1/species/154667321
This must have happened when we rebuilt the search index while deploying the new backbone last week. Why WoRMS and if there are other datasets involved with the same problem I can't tell right now.
Yes I know, but we want to get all of the nub keys for all WoRMS entries. Sending 1.2M requests is just not what we want...
Understood. This is a bug. Meanwhile you can page through the database like this, even though paging is a terrible thing for accessing all records. This is true for both a database and solr/elastic: http://api.gbif.org/v1/species?datasetKey=2d59e5db-57ad-41ff-97d6-11f5fb264527&limit=1000&offset=1000
I could probably easily expose a new service that returns just the mappings very quickly for all records of a dataset. Useful?
Definitely useful, since currently we need to apply voodoo magic to work around the 100k offset limitation :)
... the limits are there for good reasons :)
Any ETA on fixing the taxonID field or creating a mapping service?
The taxonID field seems back now. Excellent & thanks.
Problem is back: taxonID missing Reopening
@mdoering when will this be fixed? Now already 9 months without taxonID
@marcos-lg @fmendezh maybe you have an idea why the search is missing the taxonID field? It apparently was there before. The field is in NameUsageSearchResult, but not in NameUsageAvro if we use that to update ES?