project-kb
project-kb copied to clipboard
Improve products gazetteer
It would be useful to grow our existing (small) gazetteer of product names so that we can improve our ability to automatically detect project names (and as a consequence, to automatically provide the user with the right git repository, with no need for them to indicate it manually).
The existing gazetteer is in datamodel/gazetteers/products.jsonl in #260 (more precisely here: https://github.com/SAP/project-kb/blob/cc7e0540c135adf4c22d79766f36be2de8ce94cd/prospector/datamodel/gazetteers/products.jsonl), it can be used as an example.
How to do this:
- get all Apache foundation project names (e.g. using a script) from their GH organization page https://github.com/apache
- same as above for Eclipse