CoordinateCleaner icon indicating copy to clipboard operation
CoordinateCleaner copied to clipboard

Number of records detected: difference between the number for each test and the summary

Open luroy opened this issue 2 years ago • 0 comments

Dear all,

I’ve just started using CoordinateCleaner to flag and remove problematic records from the Gbif package. So I apologize in advance if my following comment is due to my unfamiliarity with this package.

From a given input (GBIF records of species occurrence), I ran the clean_coordinates function as follows:

flags<-clean_coordinates(gbif_data_df, lon = "decimalLongitude", lat = "decimalLatitude", countries = "countryCode2", species = "species", tests = c("capitals", "centroids", "duplicated", "equal", "gbif", "institutions", "outliers", "zeros"))

Here, I obtained the following results in my R console:

Testing coordinate validity Flagged 0 records. Testing equal lat/lon Flagged 0 records. Testing zero coordinates Flagged 0 records. Testing country capitals Flagged 10 records. Testing country centroids Flagged 3 records. Testing geographic outliers Flagged 44 records. Testing GBIF headquarters, flagging records around Copenhagen Flagged 0 records. Testing biodiversity institutions Flagged 0 records. Flagged 10 of 12187 records, EQ = 0

summary(flags) .val .equ .zer .cap .cen .otl .gbf .inst 0 0 0 10 3 0 0 0 .summary 10

If the 3 "centroids" flagged records seems to be comprised within the 10 "country capitals" records, I don't understand why the "geographic outliers" flagged records are not shown in the summary, nor in the "flags" object.

Thank you for your attention to this matter,

Léa

luroy avatar Aug 12 '21 16:08 luroy