owlsim-v3 icon indicating copy to clipboard operation
owlsim-v3 copied to clipboard

co-annotation analytics

Open nlwashington opened this issue 9 years ago • 7 comments

from the old sim2 codebase, i took a crack at computing a co-annotation matrix using the term frequency-inverse document frequency algorithm (TF-IDF). that code is in these methods:

computeTFIDFMatrix getCoannotatedClassesForAttribute getCoAnnotatedClassesForIndividual getCoAnnotatedClassesForAttributes getCoAnnotatedClassesForMatches populateFullCoannotationMatrix getSubsetCoannotationMatrix initCoannotationMatrix

this needs to be ported from sim2 and refactored. it worked in my tests, but the performance was terrible once i scaled up to actual full-size data. i think the refactor will need to use a sparse matrix.

these will then provide the necessary calls for services to get commonly co-annotated classes

nlwashington avatar Oct 08 '14 22:10 nlwashington