guac
guac copied to clipboard
De-duplication investigation
One thing that we're noticing for a lot of SBOM use cases is that the data that is ingested is mostly the same across SBOMs. Therefore, we want to ensure that we scale well and not explode on data cost when ingesting the same SBOMs or slight variations of SBOMs.
Aside from #321 , much more can probably be done on the backend, including normalizing Collector/Source information to help in deduplication and performing optimizations on the database backend level.