project-kb icon indicating copy to clipboard operation
project-kb copied to clipboard

Improve products gazetteer

Open copernico opened this issue 4 years ago • 0 comments

It would be useful to grow our existing (small) gazetteer of product names so that we can improve our ability to automatically detect project names (and as a consequence, to automatically provide the user with the right git repository, with no need for them to indicate it manually).

The existing gazetteer is in datamodel/gazetteers/products.jsonl in #260 (more precisely here: https://github.com/SAP/project-kb/blob/cc7e0540c135adf4c22d79766f36be2de8ce94cd/prospector/datamodel/gazetteers/products.jsonl), it can be used as an example.

How to do this:

  • get all Apache foundation project names (e.g. using a script) from their GH organization page https://github.com/apache
  • same as above for Eclipse

copernico avatar Sep 21 '21 06:09 copernico