scancode.io
scancode.io copied to clipboard
Better handle VCS checkouts and clones in a Docker image
Somehow we do not ignore .git directories in the docker pipeline That both good and bad:
- there is some (small) value to know we have a git checkout and that could be important origin clues
- these are generally ignored by scancode-toolkit with https://github.com/nexB/commoncode/blob/main/src/commoncode/ignore.py
In the current way, this ends up being noise.
Eventually we should have a better way to deal in general with ignore and "uninteresting" or "junk" files
This can be solved by having a .dockerignore file. I adopted this approach in vulnerablecode at https://github.com/nexB/vulnerablecode/pull/497/commits/743a6aa273553300d45c9ea8724aa12afa55d7f8