scancode.io
scancode.io copied to clipboard
ScanCode.io: Analyze 1000 source and binary packages with d2d pipeline
I would like to analyze roughly 1000 source and binary packages with d2d pipeline to evaluate how it performs. Some packages to consider could include xz-utils/liblzma, Apache httrace, and a slice of popular packages.
We should have a report on this to integrate in the documentation and as a blog post, including any possible CVEs/discrepancy detected in the process.
The run of a 1000 packages will need either:
-
to be scripted based on a list of to and from and calling the CLI from SCIO to run them all
-
Or we first complete the integration in PurlDB and instead add all these packages there and call the API in PurlDB
- https://github.com/nexB/purldb/issues/373
Some ideas:
Debian Packages
- TBD
Popular packages in GitHub
Some packages in the news
- redis
- log4j
JavaScript on npm:
- version of sqlite driver https://www.npmjs.com/package/sqlite3 both ELF and JS
- Some JS with minified code TBD
Uberjars:
- htrace versions such as:
- https://repo1.maven.org/maven2/org/apache/htrace/htrace-core/4.0.0-incubating/htrace-core-4.0.0-incubating-sources.jar#from
- https://repo1.maven.org/maven2/org/apache/htrace/htrace-core/4.0.0-incubating/htrace-core-4.0.0-incubating.jar#to
I will start a spreadsheet with the list of packages