guac icon indicating copy to clipboard operation
guac copied to clipboard

De-duplication investigation

Open lumjjb opened this issue 2 years ago • 0 comments

One thing that we're noticing for a lot of SBOM use cases is that the data that is ingested is mostly the same across SBOMs. Therefore, we want to ensure that we scale well and not explode on data cost when ingesting the same SBOMs or slight variations of SBOMs.

Aside from #321 , much more can probably be done on the backend, including normalizing Collector/Source information to help in deduplication and performing optimizations on the database backend level.

lumjjb avatar Jun 22 '23 18:06 lumjjb