egeria-connector-ibm-information-server icon indicating copy to clipboard operation
egeria-connector-ibm-information-server copied to clipboard

findEntitiesByProperty returns incorrect results

Open marius-patrascu opened this issue 3 years ago • 3 comments

While executing initial load, we noticed that not all the GlossaryTerms are retrieved from IGC to be processed. We are trying to retrieve 3000+ terms and the maxPageSize is set to 1000 results which means the results are retrieved in pages. The search endpoint from IGC does not have the expected behaviour. The first search page is sorted by _id however when the following pages are retrieved using the next page url provided in the IGC response body, the results are no longer sorted and this causes terms to not be included in the response and others to be duplicated. I have modified the connector code on my local environment to not use any sorting property for the first page and this results in retrieving all the GlossaryTerms. The next page URL contains the following query parameter: sorts : SearchSort{property='_id', ascending=true} which seems to do nothing. I have manually tried the queries using instead sorts : [{"property": "_id","ascending": true}] and this sorts the results correctly.

IGC info: Version ‎‪11.7.1SP2‬ Build ‎‪11.7.1SP2-dev_b164_cc72b37fee5a‬

If more info is needed, please let me know

marius-patrascu avatar Dec 29 '20 13:12 marius-patrascu