sdk icon indicating copy to clipboard operation
sdk copied to clipboard

Consider optimizations for DuplicateKeyTracker memory utilization

Open aiwilliams opened this issue 4 years ago • 0 comments

Integrations that process millions of entities and relationships and have rather large _key values can run out of memory keeping the complete set of values in the DuplicateKeyTracker. Also, there is a large impact on memory due to tracking the DuplicateKeyTrackerGraphObjectMetadata for each value, which becomes the sum of the bytes to define that object, the reference to it, and the references it has to the _key and _type strings.

This is not a problem for most integrations which process small sets of configuration information. Any optimization should be opt in, particularly if it comes at a cost of losing any features those integrations need which are supported by keeping this information around.

aiwilliams avatar Dec 30 '20 22:12 aiwilliams