project-kb icon indicating copy to clipboard operation
project-kb copied to clipboard

Add feature to indicate if a commit has a "twin"

Open copernico opened this issue 4 years ago • 2 comments

It is common, for "important" commits, to be back-ported to other branches; in particular, security fixes can be often found replicated across a number of branches, so this characteristic can be useful to find them.

Open questions:

  • since commits might not be identical (the code might differ across branches), we need some measure of similarity that is tolerant to some differences
  • the similarity measure needs to be computed efficiently

copernico avatar May 04 '21 19:05 copernico

Could be that Chapter 3 of Ullman's MMDS is what we need?

http://infolab.stanford.edu/~ullman/mmds/book.pdf

copernico avatar May 06 '21 12:05 copernico

Or maybe a less sophisticated approach could be just checking if two commit messages are the same. Not perfect but useful (and it can be replaced with a more advanced method later on).

copernico avatar Jun 17 '21 06:06 copernico